-
Notifications
You must be signed in to change notification settings - Fork 729
Pull requests: ml-explore/mlx-lm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(gpt-oss): support unsloth BF16 layout + shared MoE LoRA adapters
#1338
opened Jun 1, 2026 by
electron-rare
Loading…
Add deepseek_dsml tool parser for DeepSeek-V4's native DSML format
#1337
opened May 31, 2026 by
snagnever
Loading…
Fix json_tools: match "<tool_call" prefix to survive ">" token merge
#1336
opened May 31, 2026 by
snagnever
Loading…
Support for Poolside LagunaXS open source coding model in nvfp4
#1334
opened May 31, 2026 by
tzachicohen
Loading…
Fix seeded stochastic completions in the server
#1331
opened May 30, 2026 by
lyonsno
Contributor
Loading…
Fix detokenizer for byte-level tokenizers with an SPM-style decoder (fixes #1041)
#1329
opened May 30, 2026 by
robertlangdonn
Loading…
Fix Qwen3-Coder tool parser and harden server against mid-stream client disconnects
#1328
opened May 30, 2026 by
i1rr
Loading…
4 tasks done
Fix mlx_lm.server 404 on short prompts (clamp negative start in think-token search)
#1327
opened May 29, 2026 by
devYRPauli
Loading…
Consolidate MLA kv_b_proj sanitize/shard into shared helpers
#1324
opened May 29, 2026 by
scyyh11
Loading…
Warn when speculative decoding may hurt throughput for MoE models
#1313
opened May 26, 2026 by
Shylin26
Loading…
Fix Qwen3_5 mixed-bit per-tensor quantization keys not remapped through sanitize
#1311
opened May 25, 2026 by
ivaniguarans
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-05-30.