Random-Crypto

The Random-Crypto Benchmark is a procedurally generated dataset of cryptographic CTF challenges. The benchmark was designed for reinforcement learning of LLM based agents.

The benchmark's website can be visited here.

✅ 50 Human-verified challenges for evaluation (link)
⚙️ 5000 Non-Verified Challenges for training (link)

⚙️ Generating New Challenges

Set up the environment:

# Create a virtual environment (recommended)
python -m venv venv
source venv/bin/activate 

# Install dependencies
pip install -r requirements.txt

Make sure to set your OpenAI API key in a .env file at the root of this folder:

OPENAI_API_KEY=your-key-here

Example Usage

This code generates 50 challenges, one from each type.

python main.py --variants 1 --output_folder my_generated_challenges

This code generates 5000 challenges, one hundred from each type.

python main.py --variants 100 --output_folder my_generated_challenges

Contributors

Lajos Muzsai (muzsailajos@protonmail.com)
David Imolai (david@imol.ai)
András Lukács (andras.lukacs@ttk.elte.hu)

How To Cite

@article{muzsai2025improving,
  title={Improving LLM Agents with Reinforcement Learning on Cryptographic CTF Challenges},
  author={Muzsai, Lajos and Imolai, David and Luk{\'a}cs, Andr{\'a}s},
  journal={arXiv preprint arXiv:2506.02048},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
archetypes		archetypes
challenges		challenges
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
example_env		example_env
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Random-Crypto

⚙️ Generating New Challenges

Example Usage

Contributors

How To Cite

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

aielte-research/Random-Crypto

Folders and files

Latest commit

History

Repository files navigation

Random-Crypto

⚙️ Generating New Challenges

Example Usage

Contributors

How To Cite

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages