Non-record: Basis Block Interpolation (novel negative result) + Hyperparameter Sweep (MATRIX_LR=0.03 improves SOTA by 0.059 bpb) by j420 · Pull Request #530 · openai/parameter-golf

j420 · 2026-03-23T13:11:31Z

Novel architecture exploration + systematic hyperparameter optimization.

Key contributions:

Basis Block Interpolation: 5 basis blocks × 3 unrolls = 15 effective layers
at dim=576. Documented as informative negative result — block reuse is
bottlenecked by torch.compile(fullgraph=False) speed penalty.
Hyperparameter sweep: 15+ controlled experiments on 1xH100 SXM identifying
MATRIX_LR=0.03 as 0.059 bpb improvement over default 0.02.

Best val_bpb: 1.4963 (1xH100, standard eval)
Track: non-record

LeakyReLU(0.5)^2: zero extra params, proven -0.003 BPB vs relu^2. Addresses dead neuron problem. LEAKY_RELU=1 env var. run_no_ttt_best.sh: run3 base + three free lunches: - MATRIX_LR=0.03 (PR openai#530, verified -0.005+ BPB) - LeakyReLU(0.5)^2 (zero params, -0.003 BPB) - QAT=1 (run5 proved negative quant gap) Drops sigmoid gates and decoder 2x LR (run6 showed they hurt). Real target is openai#445 at 1.1236 (not openai#505 which doesn't fit 16MB).

j420 added 2 commits March 23, 2026 18:36

Create README.md

f0876e3

Add files via upload

68932d5

NotADevIAmaMeatPopsicle mentioned this pull request Mar 23, 2026

Record: pcloadloveletter v6 — Novel Codebook+Huffman Compression + AdamW TTT (val_bpb=1.0487) #532

Closed

bigbag mentioned this pull request Mar 23, 2026

Non-record: 1.1354 BPB — 10L TTT 22ep AdamW Cosine + LeakyReLU(0.5)² + TrigramHash #562

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-record: Basis Block Interpolation (novel negative result) + Hyperparameter Sweep (MATRIX_LR=0.03 improves SOTA by 0.059 bpb)#530

Non-record: Basis Block Interpolation (novel negative result) + Hyperparameter Sweep (MATRIX_LR=0.03 improves SOTA by 0.059 bpb)#530
j420 wants to merge 2 commits intoopenai:mainfrom
j420:main

j420 commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

j420 commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant