Skip to content

Pull requests: waybarrios/vllm-mlx

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(engine): stop MLLM text route at the model's full config EOS set
#610 opened Jun 11, 2026 by ursk Contributor Loading…
Guard --mllm against continuous batching (silent empty output)
#601 opened Jun 6, 2026 by eejd Contributor Loading…
fix(step): add Step3p5 parser support
#598 opened Jun 6, 2026 by Thump604 Collaborator Loading…
fix(qwen3-xml): parse bare <function=> without <tool_call> wrapper
#597 opened Jun 6, 2026 by CBribiescas Contributor Loading…
Add SimpleEngine prefix trie cache
#574 opened May 24, 2026 by Thump604 Collaborator Loading…
fix(gpt-oss): plumb harmony tool calls all the way through to response
#562 opened May 22, 2026 by CBribiescas Contributor Loading…
3 tasks
Reduce EngineCore idle polling
#552 opened May 20, 2026 by Thump604 Collaborator Loading…
Keep MLLM media stream on owner thread
#551 opened May 19, 2026 by Thump604 Collaborator Loading…
Keep VLM TextModel generation on owner thread
#543 opened May 17, 2026 by Thump604 Collaborator Loading…
fix: Qwen tool streaming recovery
#497 opened May 4, 2026 by kylejeske Loading…
ProTip! Follow long discussions with comments:>50.