LLM From Scratch

Working implementation of a transformer language model, built from scratch following Build a Large Language Model From Scratch by Sebastian Raschka.

Structure

src/ — Core implementation (tokenizer, attention, transformer blocks, training loop)
notebooks/ — Exploratory work and chapter exercises
data/ — Training data (gitignored if large)
tests/ — Unit tests for core components

Setup

python3.12 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Hardware

Developed on Apple Silicon (MPS backend). Cross-architecture experiments on NVIDIA (CUDA) and AMD (ROCm) GPUs documented separately.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
src/ch1		src/ch1
tests		tests
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM From Scratch

Structure

Setup

Hardware

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM From Scratch

Structure

Setup

Hardware

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages