Skip to content

Non-record: Mac mini M4 16GB, no H100s, still golfing (val_bpb=1.5672)#643

Open
frido22 wants to merge 5 commits intoopenai:mainfrom
frido22:non-record-macmini-m4-16gb-late-projection-ema
Open

Non-record: Mac mini M4 16GB, no H100s, still golfing (val_bpb=1.5672)#643
frido22 wants to merge 5 commits intoopenai:mainfrom
frido22:non-record-macmini-m4-16gb-late-projection-ema

Conversation

@frido22
Copy link
Copy Markdown

@frido22 frido22 commented Mar 24, 2026

Summary

  • Non-record Apple Silicon / MLX submission
  • Hardware: Mac mini M4 16GB
  • Verified best published run from frido22/low_vram_institute
  • Run ID: 2026_03_24_run_0033
  • Run title: Late Projection EMA Finalizer
  • Final exact post-quant score: val_bpb = 1.56720003
  • Final exact post-quant loss: val_loss = 3.53539760
  • Total artifact size: 15,962,372 bytes
  • Int8+zlib model size: 15,888,695 bytes

Notes

This is not an official 8xH100 record submission. It is intentionally submitted under records/track_non_record_16mb as a hardware-specific Apple Silicon non-record entry.

The submission folder was created from the published run package only:

  • output/runs/2026_03_24_run_0033

Verification was done against the published repo state, including:

  • state/ledger.jsonl
  • output/reports/history.csv
  • output/runs/2026_03_24_run_0033/submission.json
  • output/runs/2026_03_24_run_0033/artifact_size.json
  • output/runs/2026_03_24_run_0033/run.log

@MatoTeziTanka
Copy link
Copy Markdown

MatoTeziTanka commented Apr 11, 2026

Community Review — Non-record: Mac mini M4 16GB, no H100s, still golfing (val_bpb=1.5672)

Compliance: NEEDS AUTHOR ACTION — train_gpt.py fails to import on CT2038 (Python 3.10 / torch 2.10.0+cpu)

What I found: The CPU smoke test on CT2038 (proteus-engine, 128 GB RAM, Triton 3.6.0, flash_attn stub, cutlass_evt_fusion stub) failed at the import step with:

ModuleNotFoundError: No module named 'mlx'

A few of the common patterns I've seen for this class of error in the 2026-04-11 sweep:

Recommendation: Could you run python3 -c "import py_compile; py_compile.compile('train_gpt.py')" on your records-folder train_gpt.py under Python 3.10 specifically? The eval image is Python 3.10 per Issue #17 / the README, so any parse error on 3.10 blocks the submission at import time before any of the scored-eval logic runs.

Once the parse/import issue is fixed, I'll re-run the compliance audit through the normal pipeline. No other flags identified yet because the audit halts at the import step.


Reviewed by @MatoTeziTankaThe Agora. CPU smoke test (CT2038 proteus-engine, 2026-04-11): IMPORT_FAIL — ModuleNotFoundError: No module named 'mlx'. Classification via classify_prs.py AST-based classifier; full compliance audit deferred until the import issue is resolved. Auto-drafted from a template and spot-checked before posting.

@frido22
Copy link
Copy Markdown
Author

frido22 commented Apr 21, 2026

Superseded by #1762, which carries the updated Mac mini M4 non-record submission, the improved 1.51996743 result, and the records-folder import hardening for the earlier missing-mlx CPU smoke issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants