-
Notifications
You must be signed in to change notification settings - Fork 604
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Modify the position of npu sync ops in layerwise connector for better TTFT performance
#4478
opened Nov 26, 2025 by
anysources
Loading…
unregister mooncake layerwise connector
documentation
Improvements or additions to documentation
#4475
opened Nov 26, 2025 by
liziyu179
Loading…
mix-placement
module:core
module:ops
module:quantization
#4470
opened Nov 26, 2025 by
Mercykid-bash
Loading…
[wip]recompute scheduler adapt main
merge-conflicts
#4460
opened Nov 26, 2025 by
Shirley125
Loading…
[Bugfix] Fix bug with establishing the flashcomm2 and pp communication domains.
module:tests
#4458
opened Nov 26, 2025 by
zzhx1
Loading…
[bugfix] fix ray start failed: local_world_size cannot little than visible device count error
#4457
opened Nov 26, 2025 by
leo-pony
Loading…
[Refactor] Remove redundant attention operator branches.
#4455
opened Nov 26, 2025 by
weijinqian0
Loading…
[Doc] Add single NPU tutorial for Qwen2.5-Omni-7B
documentation
Improvements or additions to documentation
#4446
opened Nov 26, 2025 by
Semmer2
Loading…
[Performance] Improve the inference performance of Eagle3.
#4442
opened Nov 25, 2025 by
liumain1122
Loading…
[Performance] Improve the inference performance of Eagle3.
#4441
opened Nov 25, 2025 by
liumain1122
Loading…
support pcp & dcp for 0.11.0 vllm-ascend
merge-conflicts
module:core
#4439
opened Nov 25, 2025 by
zhenwenqi2024
•
Draft
[Feature][main]reconstruction kvpool connector to ascend connector
documentation
Improvements or additions to documentation
module:tests
#4438
opened Nov 25, 2025 by
fems14
Loading…
upgrade torch npu version
documentation
Improvements or additions to documentation
module:tests
ready
read for review
ready-for-test
start test by label for PR
#4433
opened Nov 25, 2025 by
wangxiyuan
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.