WIP: FreqGPTQ + GatedDeltaNet + Adaptive Quantization by OleStan · Pull Request #1743 · openai/parameter-golf

OleStan · 2026-04-19T14:41:04Z

Summary

Built on PR Record: GatedDeltaNet (FLA) + Legal Score-First TTT — val_bpb 1.00995 (3-seed mean) #1698 (GatedDeltaNet + Legal TTT, 1.00995 BPB)
FreqGPTQ: frequency-weighted Hessian calibration — top-100 tokens get 2x weight in GPTQ
PassthroughQuant: int8 for control tensors instead of fp16 (~40KB savings)
Sandwich quantization: int8 for final block to protect LM head signal
Adaptive embedding precision: int8 for top-100 frequent tokens, intN for rest
Configurable Int5/6 GPTQ with synced Late QAT clip range
LZMA self-extracting wrapper: ~73KB savings for model budget

Status

WIP — code complete, pending GPU validation. Will update with BPB results and 3-seed logs once compute is available.

Test plan

Reproduce PR Record: GatedDeltaNet (FLA) + Legal Score-First TTT — val_bpb 1.00995 (3-seed mean) #1698 baseline (1.00995 BPB) on 8xH100
Validate FreqGPTQ improvement at Int6
Sweep Int5 vs Int6 quality cliff
3-seed validation with statistical significance
Confirm artifact under 16MB and eval under 600s

Built on PR openai#1698 (GatedDeltaNet + Legal TTT). Adds: - FreqGPTQ: frequency-weighted Hessian calibration for GPTQ - PassthroughQuant: int8 for control tensors (saves ~40KB) - Sandwich quantization: int8 for final block - Adaptive embedding precision: int8 top-100 / intN rest - Configurable Int5/6 GPTQ with synced QAT - LZMA wrapper saves ~73KB Pending GPU validation for BPB results.

OleStan force-pushed the freqgptq-gateddeltanet branch from 80cad2a to 1cda344 Compare April 19, 2026 14:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: FreqGPTQ + GatedDeltaNet + Adaptive Quantization#1743

WIP: FreqGPTQ + GatedDeltaNet + Adaptive Quantization#1743
OleStan wants to merge 1 commit intoopenai:mainfrom
OleStan:freqgptq-gateddeltanet

OleStan commented Apr 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

OleStan commented Apr 19, 2026

Summary

Status

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant