-
Notifications
You must be signed in to change notification settings - Fork 188
Pull requests: waybarrios/vllm-mlx
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(text-model-from-vlm): realize private lazy arrays before the model leaves the build thread
#614
opened Jun 12, 2026 by
ursk
Contributor
Loading…
fix(ssd-cache): preserve original bfloat16 dtype across quantized spill
#612
opened Jun 12, 2026 by
CBribiescas
Contributor
Loading…
Keep assistant tool_calls and tool messages on the MLLM chat path
#611
opened Jun 12, 2026 by
waybarrios
Owner
Loading…
fix(engine): stop MLLM text route at the model's full config EOS set
#610
opened Jun 11, 2026 by
ursk
Contributor
Loading…
Guard --mllm against continuous batching (silent empty output)
#601
opened Jun 6, 2026 by
eejd
Contributor
Loading…
fix(qwen3-xml): parse bare <function=> without <tool_call> wrapper
#597
opened Jun 6, 2026 by
CBribiescas
Contributor
Loading…
feat(mllm): auto-extract audio from video_url on omni models
#591
opened Jun 3, 2026 by
txdadlab
Loading…
fix(gpt-oss): route harmony prompts through openai-harmony (refs #568)
#581
opened May 25, 2026 by
CBribiescas
Contributor
Loading…
fix(llama-tool-parser): recognize Llama 3.1+ / 3.3 tool-call formats
#580
opened May 25, 2026 by
CBribiescas
Contributor
Loading…
fix(gpt-oss): plumb harmony tool calls all the way through to response
#562
opened May 22, 2026 by
CBribiescas
Contributor
Loading…
3 tasks
Keep VLM TextModel generation on owner thread
#543
opened May 17, 2026 by
Thump604
Collaborator
Loading…
perf: O(1) tool lookup in ToolExecutor via lazily-cached name index
optimization
#449
opened Apr 26, 2026 by
clickbrain
Contributor
Loading…
ProTip!
Follow long discussions with comments:>50.