Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Doc] Slight improvement to M2 and beyond documentation Improvements or additions to documentation
#27554 opened Oct 27, 2025 by jeejeelee Loading…
5 tasks
covt_e4m3_bf16
#27553 opened Oct 27, 2025 by wangyxbh Loading…
5 tasks
[Misc] Clean up utils documentation Improvements or additions to documentation frontend kv-connector ready ONLY add when PR is ready to merge/full CI is needed
#27552 opened Oct 27, 2025 by DarkLight1337 Loading…
5 tasks
[Model] Deprecate merge_by_field_config=False multi-modality Related to multi-modality (#4194) ready ONLY add when PR is ready to merge/full CI is needed
#27551 opened Oct 27, 2025 by DarkLight1337 Loading…
5 tasks
[CI/Build] Test torchrun with 8 cards ci/build documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#27548 opened Oct 27, 2025 by 22quinn Loading…
3 of 5 tasks
[Docs] add Shanghai Meetup - 2025/10 documentation Improvements or additions to documentation
#27545 opened Oct 27, 2025 by kebe7jun Loading…
5 tasks
Enhance benchmark_moe.py compatibility issues across vLLM versions performance Performance-related issues
#27541 opened Oct 27, 2025 by massif-01 Loading…
fixing mm placeholder replacement issue with gemma3 ready ONLY add when PR is ready to merge/full CI is needed
#27538 opened Oct 26, 2025 by tingtingtang1992 Loading…
5 tasks
[Kernel] add cuda kernel of causal_conv1d for qwen3-next qwen Related to Qwen models
#27534 opened Oct 26, 2025 by ZJY0516 Draft
5 tasks
[perf] Optimize Qwen2-VL Startup Performance with LRU Cache qwen Related to Qwen models
#27533 opened Oct 26, 2025 by skyloevil Draft
[Attention] Use sparse prefill kernel for fp8 kv-cache in DeepSeek-v3.2 deepseek Related to DeepSeek models v1
#27532 opened Oct 26, 2025 by LucasWilkinson Loading…
[CI/Build] Bump transformers version ci/build ready ONLY add when PR is ready to merge/full CI is needed
#27528 opened Oct 26, 2025 by DarkLight1337 Loading…
5 tasks
[Bugfix][CPU] Fallback oneDNN linear to torch linear to fix half gemm support on legecy platforms ci/build ready ONLY add when PR is ready to merge/full CI is needed
#27526 opened Oct 26, 2025 by bigPYJ1151 Loading…
3 of 5 tasks
[Multimodal][XPU]Add vision attn backend for xpu platform qwen Related to Qwen models
#27525 opened Oct 26, 2025 by yma11 Draft
[Fix] Change default MXFP4 backend for SM90 to Marlin
#27523 opened Oct 26, 2025 by mmangkad Loading…
5 tasks
[model] Add support for openPangu_Ultra_MoE documentation Improvements or additions to documentation new-model Requests to new models speculative-decoding v1
#27521 opened Oct 26, 2025 by yt0428 Loading…
5 tasks
feat: [DRAFT Ignore for now] Add Omnivinci model + subfolder HF config/tokenizer support new-model Requests to new models
#27520 opened Oct 26, 2025 by 0xrushi Loading…
3 of 5 tasks
[bugfix] fix wrong dcp_local_seq_lens calc v1
#27518 opened Oct 26, 2025 by pisceskkk Loading…
Fix issue #27486 double bos token
#27515 opened Oct 25, 2025 by baonudesifeizhai Loading…
5 tasks
[PERF] Decouple projections from GDN custom op qwen Related to Qwen models
#27512 opened Oct 25, 2025 by vadiklyutiy Loading…
Add standalone multimodal encoder benchmark frontend performance Performance-related issues
#27511 opened Oct 25, 2025 by alhridoy Loading…
add cpu device support for nixl_connector kv-connector
#27510 opened Oct 25, 2025 by ZhengHongming888 Loading…
5 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.