Record: 11L Parallel Muon + N-gram Backoff Cache — val_bpb 0.2841 (3-seed mean)#865
Open
aryanbhosale wants to merge 1 commit into
Open
Record: 11L Parallel Muon + N-gram Backoff Cache — val_bpb 0.2841 (3-seed mean)#865aryanbhosale wants to merge 1 commit into
aryanbhosale wants to merge 1 commit into