Skip to content

Kurtiscwright/llm-from-scratch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LLM From Scratch

Working implementation of a transformer language model, built from scratch following Build a Large Language Model From Scratch by Sebastian Raschka.

Structure

  • src/ — Core implementation (tokenizer, attention, transformer blocks, training loop)
  • notebooks/ — Exploratory work and chapter exercises
  • data/ — Training data (gitignored if large)
  • tests/ — Unit tests for core components

Setup

python3.12 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Hardware

Developed on Apple Silicon (MPS backend). Cross-architecture experiments on NVIDIA (CUDA) and AMD (ROCm) GPUs documented separately.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages