Skip to content

Pull requests: ml-explore/mlx-lm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Cohere] Add cohere2_moe model support
#1340 opened Jun 2, 2026 by Terrencezzj Loading…
Add Mellum (Mellum 2) model support
#1339 opened Jun 2, 2026 by jedisct1 Loading…
Fix LFM2.5 pythonic tool parser auto-detection
#1333 opened May 31, 2026 by ChristianWeyer Loading…
Fix seeded stochastic completions in the server
#1331 opened May 30, 2026 by lyonsno Contributor Loading…
Add Step 3.7 Flash
#1325 opened May 29, 2026 by kernelpool Contributor Loading…
Guard MLA caches from KV quantization
#1323 opened May 28, 2026 by xxxkkw Loading…
Add BatchQuantizedKVCache
#1322 opened May 28, 2026 by xxxkkw Loading…
Keep model parameters for fuse GGUF export
#1321 opened May 28, 2026 by abnormal749 Loading…
Fix Qwen3.5 MTP sanitize norm shift
#1320 opened May 28, 2026 by xxxkkw Loading…
Add optional fused SwitchGLU gate-up projection
#1319 opened May 28, 2026 by xxxkkw Loading…
Add opt-in recurrent profiling instrumentation
#1318 opened May 28, 2026 by xxxkkw Loading…
Add MiniCPM5 XML tool-call parser
#1317 opened May 28, 2026 by scrappylabsai Loading…
2 tasks
Add olmo hybrid
#1315 opened May 27, 2026 by cmurray1105 Loading…
Add hybrid prefix cache restore coverage
#1314 opened May 27, 2026 by xxxkkw Loading…
Expose tunable Metal memory limits in the trainer
#1312 opened May 26, 2026 by kru2710shna Loading…
ProTip! Updated in the last three days: updated:>2026-05-30.