-
-
Notifications
You must be signed in to change notification settings - Fork 10.8k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Doc] Slight improvement to M2 and beyond
documentation
Improvements or additions to documentation
#27554
opened Oct 27, 2025 by
jeejeelee
Loading…
5 tasks
[Misc] Clean up utils
documentation
Improvements or additions to documentation
frontend
kv-connector
ready
ONLY add when PR is ready to merge/full CI is needed
#27552
opened Oct 27, 2025 by
DarkLight1337
Loading…
5 tasks
[Model] Deprecate Related to multi-modality (#4194)
ready
ONLY add when PR is ready to merge/full CI is needed
merge_by_field_config=False
multi-modality
#27551
opened Oct 27, 2025 by
DarkLight1337
Loading…
5 tasks
[CI/Build] Test torchrun with 8 cards
ci/build
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#27548
opened Oct 27, 2025 by
22quinn
Loading…
3 of 5 tasks
[BUG] Fix hybrid kvcache kernel page size issue
v1
#27547
opened Oct 27, 2025 by
vadiklyutiy
Loading…
[Docs] add Shanghai Meetup - 2025/10
documentation
Improvements or additions to documentation
#27545
opened Oct 27, 2025 by
kebe7jun
Loading…
5 tasks
Enhance benchmark_moe.py compatibility issues across vLLM versions
performance
Performance-related issues
#27541
opened Oct 27, 2025 by
massif-01
Loading…
fixing mm placeholder replacement issue with gemma3
ready
ONLY add when PR is ready to merge/full CI is needed
#27538
opened Oct 26, 2025 by
tingtingtang1992
Loading…
5 tasks
[Attention] Use sparse prefill kernel for fp8 kv-cache in DeepSeek-v3.2
deepseek
Related to DeepSeek models
v1
#27532
opened Oct 26, 2025 by
LucasWilkinson
Loading…
[CI/Build] Bump transformers version
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#27528
opened Oct 26, 2025 by
DarkLight1337
Loading…
5 tasks
[Bugfix][CPU] Fallback oneDNN linear to torch linear to fix half gemm support on legecy platforms
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#27526
opened Oct 26, 2025 by
bigPYJ1151
Loading…
3 of 5 tasks
[Fix] Change default MXFP4 backend for SM90 to Marlin
#27523
opened Oct 26, 2025 by
mmangkad
Loading…
5 tasks
[model] Add support for openPangu_Ultra_MoE
documentation
Improvements or additions to documentation
new-model
Requests to new models
speculative-decoding
v1
#27521
opened Oct 26, 2025 by
yt0428
Loading…
5 tasks
feat: [DRAFT Ignore for now] Add Omnivinci model + subfolder HF config/tokenizer support
new-model
Requests to new models
#27520
opened Oct 26, 2025 by
0xrushi
Loading…
3 of 5 tasks
[bugfix] modify api server for multi-modal inputs
ci/build
frontend
#27516
opened Oct 25, 2025 by
WorldExplored
Loading…
[PERF] Decouple projections from GDN custom op
qwen
Related to Qwen models
#27512
opened Oct 25, 2025 by
vadiklyutiy
Loading…
Add standalone multimodal encoder benchmark
frontend
performance
Performance-related issues
#27511
opened Oct 25, 2025 by
alhridoy
Loading…
add cpu device support for nixl_connector
kv-connector
#27510
opened Oct 25, 2025 by
ZhengHongming888
Loading…
5 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.