RZZ Transformer Project

This project trains a decoder-only transformer model in PyTorch to predict the imaginary part of the non-trivial zeros of the Riemann Zeta function (rzz). The data is provided via a function interface that returns zeros in high-precision format. For training the model, we work on a next token prediction task over a small vocabulary.

Execution Order

Hyperparameter Tuning
Run python src/tune_hyper_parameters.py to grid search over a subset (first 10,000 zeros) and select the best hyperparameters.
Training
Run python src/train.py to train the model on successive million-zero intervals. The model saves checkpoints (which can be resumed).
Evaluation
Run python src/evaluate.py to evaluate the fully trained model on the test set using nucleus sampling (p = 0.9) and report the mean squared error.

Notes

The data is loaded via the function zeros_starting_at_N(N, number_of_zeros) from zeros_db.py.
The tokens are defined over the vocabulary:
['0', '1', '2', '3', '4', '5', '6', '7', '8', '9', '.', ':', 'b', 'e', ' ', 'p']
The context window is fixed to 32 tokens. Data shorter than 32 tokens is padded with 'p'.
The transformer is implemented from basic building blocks in PyTorch.

Feel free to customize the code further to suit your hardware and runtime (Mac mini M4 or cloud GPUs/TPUs).

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
.gitignore		.gitignore
README.md		README.md
prompt.md		prompt.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RZZ Transformer Project

Execution Order

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ashkankzme/rzz

Folders and files

Latest commit

History

Repository files navigation

RZZ Transformer Project

Execution Order

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages