Skip to content

Record: SP4096 + Compressibility Regularization — val_bpb 1.11349 (6-seed mean)#1508

Closed
jpfeiffe wants to merge 2 commits intoopenai:mainfrom
jpfeiffe:submit/sp4096-wd2-brotli
Closed

Record: SP4096 + Compressibility Regularization — val_bpb 1.11349 (6-seed mean)#1508
jpfeiffe wants to merge 2 commits intoopenai:mainfrom
jpfeiffe:submit/sp4096-wd2-brotli

Conversation

@jpfeiffe
Copy link
Copy Markdown

@jpfeiffe jpfeiffe commented Apr 9, 2026

6-seed mean SW val_bpb: 1.11349
#1019 3-seed mean: 1.11473
Delta: -0.00125
Welch's t-test (one-sided): t=-4.19, df≈6.6, p=0.00289

Three changes from #1019:

  1. SP4096 tokenizer (sproos/parameter-golf-tokenizers,
    docs_sha256 verified identical to willdepueoai/parameter-golf)
  2. WARMDOWN_WD_MULT=2.0 (ramped weight decay during warmdown)
  3. brotli-11 selected as artifact compressor when smaller
    than lzma (true for all 6 seeds)

All artifacts under 16MB. Pruning: 0% across all 6 seeds.

See README for full mechanism, statistical details, and
reproduction. bpb_verification.md and data_lineage.md cover
the manual BPB recomputation and data lineage proof.

Files:
records/track_10min_16mb/2026-04-08_SP4096_WD2_brotli_jpfeiffe/

jpfeiffe and others added 2 commits April 9, 2026 18:05
…seed mean)

SP4096 tokenizer + WARMDOWN_WD_MULT=2.0 + Brotli-11 compression
on the openai#1019 GPTQ+XSA base. 6-seed mean 1.11349 vs openai#1019's 1.11474.
Welch t=-4.19, df=6.6, p=0.00289 (one-sided).

No TTT, no eval-time compute. WD compression frees ~1.5MB which
absorbs SP4096's +1.1MB embedding cost. Pruning 0% on all 6 seeds.

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>
- Removed all references to other PRs (openai#365, openai#1089, etc.)
- Removed "novel" language
- Brotli: "selected when smaller than LZMA-9" (matches code)
- WD ramp: accurately describes group["weight_decay"] mechanism
- Added Verification section pointing to support files
- Streamlined Changes section

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>
@jpfeiffe jpfeiffe closed this Apr 10, 2026
@jpfeiffe
Copy link
Copy Markdown
Author

Other work has been merged in since filed.

Copy link
Copy Markdown

@akhoyannh-a11y akhoyannh-a11y left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@@ -0,0 +1,28 @@
# BPB Verification
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
# BPB Verification
# BPB Verification[]()

@@ -0,0 +1,34 @@
# Data Lineage Verification
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants