Skip to content

Ablation: WiderGate32, RoPE dims, activation slopes, hparam stack (8xH100)#1970

Open
bsisduck wants to merge 1 commit into
openai:mainfrom
bsisduck:ablation/wider-gate-rope-activation-hparams
Open

Ablation: WiderGate32, RoPE dims, activation slopes, hparam stack (8xH100)#1970
bsisduck wants to merge 1 commit into
openai:mainfrom
bsisduck:ablation/wider-gate-rope-activation-hparams

Commits