Skip to content

Record: Order-Adaptive Entropy Gating + XSA-All (val_bpb=0.9370)#774

Open
travispchen wants to merge 1 commit intoopenai:mainfrom
travispchen:order-adaptive-entropy-gating
Open

Record: Order-Adaptive Entropy Gating + XSA-All (val_bpb=0.9370)#774
travispchen wants to merge 1 commit intoopenai:mainfrom
travispchen:order-adaptive-entropy-gating

Conversation

@travispchen
Copy link

N-gram7 BPB: 0.9370 (±0.0003) across seeds 1337/42/2025
Sliding BPB: 1.1222 (±0.0003)
Artifact: ~15.9 MB (within 16MB cap)
Training: 600s on 8xH100

Key innovation: order-adaptive entropy gating assigns different entropy thresholds per n-gram order. High-order matches (7-gram) trusted at moderate model confidence; low-order matches (2-gram) only trusted when model is very uncertain.

Built on PR #753 (Podracing II) with XSA extended to all 11 layers and entropy_center=3.0.

…ed mean)

N-gram7 BPB: 0.9370 (±0.0003) across seeds 1337/42/2025
Sliding BPB: 1.1222 (±0.0003)
Artifact: ~15.9 MB (within 16MB cap)
Training: 600s on 8xH100

Key innovation: order-adaptive entropy gating assigns different
entropy thresholds per n-gram order. High-order matches (7-gram)
trusted at moderate model confidence; low-order matches (2-gram)
only trusted when model is very uncertain.

Built on PR openai#753 (Podracing II) with XSA extended to all 11 layers
and entropy_center=3.0.

Co-Authored-By: Travis Chen <travispchen@gmail.com>
@newjordan
Copy link

newjordan commented Mar 26, 2026

we need to share notes! i mean.. we jsut did =) but claibrations. I havent calibrated it yet

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants