Non-record: Cross-Base Regularizer Transferability — methodological study (20+ cells, 10 figures)#2011
Open
BharathSShankar wants to merge 1 commit intoopenai:mainfrom
Conversation
…logical study Cross-base regularizer transferability study: 7 novel regularizers (QAHSP, ES, AOS, HSU, WBC, WOP, PCS) measured on two distinct training bases (SP10240+SimCTG vs PR openai#1965 phased TTT lineage). Findings: 1. Cross-base sign change for QAHSP / ES (3.80 mBPB swing). 2. Quant cost is reg-independent on Base A (14.3-14.9 mBPB tax). 3. PreQuantTTT x ES compounds; PreQuantTTT x QAHSP doesn't. 4. Three mechanistic checks (SVD spectrum, depth trajectory, CKA) show regs leave a sub-quant-noise fingerprint upstream. This is a research/analysis contribution, not a record submission. No model artifact; ships README + 15 figures + 6 analysis JSON tables + reproduction scripts.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Methodological (non-record) contribution. Studies how seven novel
regularizers — QAHSP, ES, AOS, HSU, WBC, WOP, PCS — transfer across two
distinct training bases for the 10-min / 16 MB track:
The headline finding is that regularizer benefit does not transfer
across bases: a regularizer like QAHSP that helps Base A by ≈0.1 mBPB
moves Base B by ≈ +3.7 mBPB (i.e. it hurts). ES shows a similar but
smaller swing. We also measure a real-data reg × quant interaction
matrix across 7 quantization schemes (int4/6/8 sym/asym per-tensor/per-row,
AWQ-lite, GPTQ-lite) and decompose the eval pipeline into pre-quant
grade → post-quant → sliding-window → TTT stages.
This is a research/analysis contribution intended to inform reviewers
and future submitters about regularizer-base interaction. No model
artifact is shipped.
Contents
README.md— 17-section writeup with hypotheses, methodology,cross-base swings, pipeline-stage attribution, and statistical caveats.
figures/— 10 figures (cross-base signs, pipeline waterfall, realpre-vs-post quant, PQT × reg compounding, lambda-budget pair-vs-single,
3D PCA per-reg, canonical metrics, coord and L2-norm distributions,
reg × quant matrix heatmap).
*.json— analysis tables (real_canonical_metrics, real_reg_quant_matrix,pipeline_attribution, eval_pipeline_breakdown).
run_reg_quant_matrix.py,build_real_data_figures.py,build_synergy_figures.py— reproduction scripts.What we DO claim
base; "transferability" is not a free assumption.
7 regs is trained with — quant cost is reg-independent within ~14-15 mBPB.
suggesting direction-shaping vs codebook-shaping regs interact
differently with TTT.
What we do NOT claim
seed=42 only). The headline cross-base sign change is robust enough to
call out, but mBPB-magnitude claims are point estimates.
embedding geometry analysis is suggestive, not predictive of post-quant
val_bpb at this scale.
cells listed in §6.
Test plan
do NOT claim).
load-bearing finding, not the per-cell mBPB numbers.
track_non_record_16mb/asa methodological contribution, not as a record entry.
🤖 Generated with Claude Code