Record: Complementary Training + Backoff N-gram Mixer — 0.4377 BPB by quietsmile · Pull Request #1 · quietsmile/parameter-golf

quietsmile · 2026-03-26T05:34:05Z

Summary

0.4377 BPB (2-seed mean 0.4379, std 0.0002)
Reproduction of PR Record: 0.4416 BPB -- Complementary Training + Backoff N-gram Mixer openai/parameter-golf#803's complementary training approach on 8x NVIDIA L20Z (H100 equivalent)
Seeds: 1337 → 0.4377, 42 → 0.4380
Training: 7,003 steps in 600s, eval: 450s (within 10-min budget)
Artifact: ~15.9MB (under 16MB limit)

Key Techniques

Complementary Training (COMPLEMENT_ALPHA=0.5): bigram-weighted loss reweighting
BackoffNgramMixer: orders 2-10, entropy-adaptive alpha mixing
Legal score-first AdamW TTT: 4 epochs, lr=5e-4, freeze first 2 blocks
Stride=128: negligible BPB impact, halves eval time

Acknowledgment

Based on PR openai#803 by @pentxayc. Core innovation of complementary training is their contribution.

Results

Seed	Steps	val_bpb	eval_time
1337	7,003	0.4377	450s
42	7,011	0.4380	450s

Reproduction of PR openai#803's complementary training approach on 8x L20Z (H100). Two-seed validation: 0.4377 (seed=1337), 0.4380 (seed=42). Key: bigram-weighted loss reweighting (COMPLEMENT_ALPHA=0.5) trains the neural model to specialize on tokens n-gram caches can't predict, combined with BackoffNgramMixer (orders 2-10) and legal score-first AdamW TTT. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Record: Complementary Training + Backoff N-gram Mixer — 0.4377 BPB#1

Record: Complementary Training + Backoff N-gram Mixer — 0.4377 BPB#1
quietsmile wants to merge 1 commit intomainfrom
submission/complementary-backoff-ngram-mixer

quietsmile commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

quietsmile commented Mar 26, 2026

Summary

Key Techniques

Acknowledgment

Results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant