Record: Polar Express NS + MIN_LR + GatedAttn + Alpha LoRA — val_bpb 1.07006 (3-seed mean) by renqianluo · Pull Request #1792 · openai/parameter-golf

renqianluo · 2026-04-23T21:36:19Z

Summary

Stacks three independently-validated improvements from other authors onto our PR #1768:

Polar Express NS coefficients (ported from PR Record: SP4096 + Polar Express + MuonEq-R + Depth Recurrence — 1.0923 BPB (3-seed) #1344) — 5 per-iteration minimax-optimal (a, b, c) Newton-Schulz tuples instead of the single fixed (3.4445, -4.775, 2.0315) applied 5 times. Higher-quality polar factor at unchanged MUON_BACKEND_STEPS=5.
MIN_LR=0.10 warmdown floor (from @nprime06 PR Record: PR #1736 + Polar Express NS + MIN_LR + Sparse Attn Gate + Fused CE + PR #1767 TTT — val_bpb 1.06335 #1787) — final ~25% of training keeps delivering useful gradient updates.
Tight budget polish (from @nprime06 PR Record: PR #1736 + Polar Express NS + MIN_LR + Sparse Attn Gate + Fused CE + PR #1767 TTT — val_bpb 1.06335 #1787): GPTQ_RESERVE_SECONDS=0.5 (vs 4.0) + VAL_LOSS_EVERY=0 reclaim ~15s for extra training steps.

Trajectory

Seed	PR #1767	PR #1768	This
1337	1.07189	1.07146	1.07027
42	1.07248	1.07014	1.06964
314	1.07189	1.07082	1.07026
Mean	1.07209	1.07081	1.07006

Every seed improves monotonically across each change.

Compliance

Train 599.6s (all 3), eval 474–481s, artifact 15.98MB. Issue #1017 conditions 1–4 verified.

Attribution

@orangekame3 et al (PR Record: SP4096 + Polar Express + MuonEq-R + Depth Recurrence — 1.0923 BPB (3-seed) #1344) — Polar Express NS coefficients
@nprime06 (PR Record: PR #1736 + Polar Express NS + MIN_LR + Sparse Attn Gate + Fused CE + PR #1767 TTT — val_bpb 1.06335 #1787) — MIN_LR floor, tight GPTQ_RESERVE, VAL_LOSS_EVERY=0
@dexhunter (PR Record: SP8192 + CaseOps + GatedAttn + QuantGate + Loop45 + PhasedTTT — val_bpb 1.06549 #1736) — Gated Attention
@samacqua (PR Record: Varlen attention + fused MLP + doc-independent TTT (1.07336) #1530) — VarLen + Fused MLP + doc LoRA TTT
@romeerp (PR Record: VarLenAttn + PhasingTTT - val_bpb 1.0728 (3-seed mean) #1610), @dexhunter — Phased TTT + multi-phase global SGD
@bigbag (PR Record: SP8192 + 3-Layer Recurrence + Parallel Residuals + QK-Gain 5.25 + Legal TTT — val_bpb 1.0810 (3-seed mean) #1493), @EthanYangTW (PR Record: SP8192 + Triple Recurrence + Banking + Fused MLP + Muon 0.97 — val_bpb 1.0778 (3-seed mean) #1523) — triple recurrence + parallel residuals
@abaybektursun (PR Record: LeakyReLU² + Legal Score-First TTT + Parallel Muon — val_bpb 1.1194 (3-seed mean) #549) — legal TTT framework
@renqianluo: PR Record: Alpha=144 LoRA + Warm-start A + WD 1.0 — val_bpb 1.07209 (3-seed mean) #1767 (alpha-LoRA + warm + WD) and PR Add non-record 16MB SP1024 ShareVLast3 3-seed submission #1768 (gate mirror + int8 per-row gate quant)

@nprime06

…1.07006 (3-seed mean) Stacks three independently-validated improvements onto PR openai#1768: 1. Polar Express NS coefficients (ported from PR openai#1344) — 5 per-iteration minimax-optimal Newton-Schulz tuples instead of the single fixed (3.4445,-4.775,2.0315) tuple. 2. MIN_LR=0.10 warmdown floor (from PR openai#1787 @nprime06) — final 25% of training keeps delivering useful gradient updates. 3. Tight budget polish (from PR openai#1787 @nprime06): GPTQ_RESERVE_SECONDS=0.5 (vs 4.0) + VAL_LOSS_EVERY=0 reclaim ~15s for extra depth-3 training steps. 3-seed mean 1.07006 BPB (seeds 1337, 42, 314). All seeds improve monotonically over PR openai#1768.

resouer mentioned this pull request Apr 25, 2026

[REVIEW-ONLY] Record: SP8192 + Polar Express NS + MIN_LR + Tight GPTQ on PR #1790 — val_bpb 1.06892 (3-seed mean) resouer/parameter-golf#12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Record: Polar Express NS + MIN_LR + GatedAttn + Alpha LoRA — val_bpb 1.07006 (3-seed mean)#1792

Record: Polar Express NS + MIN_LR + GatedAttn + Alpha LoRA — val_bpb 1.07006 (3-seed mean)#1792
renqianluo wants to merge 1 commit intoopenai:mainfrom
renqianluo:record/polar-minlr-1.07006

renqianluo commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

renqianluo commented Apr 23, 2026

Summary

Trajectory

Compliance

Attribution

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant