Add LR0.85 prefix2750 legal TTT record by ZanePeycke · Pull Request #2047 · openai/parameter-golf

ZanePeycke · 2026-04-30T23:30:52Z

Record submission: AutoZany LR0.85 + prefix2750 legal phased TTT

This PR adds one new 10min/16MB record folder:

records/track_10min_16mb/2026-04-30_AutoZany_LR085_Prefix2750_LegalTTT_1.05908/

Result

3-seed mean val_bpb: 1.05907559
Population std: 0.00041335
Mean val_loss: 2.31764997
Hardware: 8x H100 SXM
Track: 10min_16mb

Seed	Stop step	Train ms	Pre-quant BPB	Quant no-TTT BPB	Final BPB	Eval ms	Artifact bytes
42	4889	596017	1.06193713	1.07022525	1.05849788	473829	15,976,870
0	4888	596020	1.06282344	1.07113858	1.05928718	464754	15,980,787
1234	4906	596115	1.06264173	1.07122159	1.05944171	468273	15,984,508

What changed

This is a conservative final-day variant on the public PR #1953 / PR #1945 lineage. It keeps the same legal score-first phased TTT path and changes the final TTT evaluation neighborhood:

TTT_LOCAL_LR_MULT=0.85
PHASED_TTT_PREFIX_DOCS=2750
PHASED_TTT_NUM_PHASES=3
EVAL_SEQ_LEN=2560
TTT_EVAL_SEQ_LEN=2560
TTT_MASK=no_qv
TTT_Q_LORA=0
TTT_V_LORA=0
QK_GAIN_INIT=5.25
ASYM_LOGIT_RESCALE=1
AWQ_LITE_ENABLED=1
COMPRESSOR=pergroup

The submitted train_gpt.py is the PR #1953 stack source used for the verified runs. The final BPBs above come from TTT_EVAL_ONLY=1 re-evaluations of the saved artifacts with PHASED_TTT_PREFIX_DOCS=2750.

Compliance checklist

Files included

README.md: method summary, results table, compliance notes, reproduction command, lineage.
submission.json: structured metadata and per-seed results.
train_gpt.py: executable training/eval script.
train_seed42.log, train_seed0.log, train_seed1234.log: full train + quantization logs.
ttt_prefix2750_seed42.log, ttt_prefix2750_seed0.log, ttt_prefix2750_seed1234.log: final TTT_EVAL_ONLY=1 prefix2750 eval logs.

Reproduction

Run the script once per seed with the config shown in the record README. To reproduce the final reported score from a saved artifact, rerun with:

TTT_EVAL_ONLY=1 \
PHASED_TTT_PREFIX_DOCS=2750 \
TTT_LOCAL_LR_MULT=0.85 \
TTT_MASK=no_qv \
TTT_Q_LORA=0 \
TTT_V_LORA=0 \
TTT_EVAL_SEQ_LEN=2560 \
TTT_CHUNK_SIZE=48 \
COMPRESSOR=pergroup \
torchrun --standalone --nproc_per_node=8 train_gpt.py

Lineage

Built on the public PR #1953 / PR #1945 / PR #1855 lineage: AWQ-lite, Asymmetric Logit Rescale, CaseOps tokenizer, SparseAttnGate, SmearGate, LQER, QK gain, and legal score-first phased TTT. This PR contributes the final-day TTT_LOCAL_LR_MULT=0.85 + PHASED_TTT_PREFIX_DOCS=2750 legal eval selection and 3-seed verification under the hard time and artifact limits.

Add AutoZany final-day legal TTT record

561ab04

ZanePeycke changed the title ~~Add AutoZany LR0.85 prefix2750 legal TTT record~~ Add LR0.85 prefix2750 legal TTT record May 1, 2026

cocohearts mentioned this pull request May 2, 2026

Update leaderboard with May 1 audited rows #2146

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LR0.85 prefix2750 legal TTT record#2047

Add LR0.85 prefix2750 legal TTT record#2047
ZanePeycke wants to merge 1 commit intoopenai:mainfrom
ZanePeycke:codex/final-day-lrmult085-prefix2750

ZanePeycke commented Apr 30, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ZanePeycke commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Record submission: AutoZany LR0.85 + prefix2750 legal phased TTT

Result

What changed

Compliance checklist

Files included

Reproduction

Lineage

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ZanePeycke commented Apr 30, 2026 •

edited

Loading