fix qwen3vl ci error #4476

shaopeng-666 · 2025-11-26T14:24:37Z

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.11.2
vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2

github-actions · 2025-11-26T14:26:30Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

The pull request addresses a CI error related to Qwen3VL by modifying the AscendQwen3_VisionTransformer class to handle grid_thw input more flexibly. While the changes introduce support for both torch.Tensor and list[list[int]] input types, there's an opportunity to improve efficiency and maintainability by streamlining data type conversions and removing an unnecessary dependency.

gemini-code-assist · 2025-11-26T14:28:41Z

vllm_ascend/models/qwen2_5_vl_without_padding.py

 from functools import partial
 from typing import Callable, Optional

+import numpy as np


The numpy library is imported but its usage for intermediate conversions to np.array and .numpy() can be eliminated by directly handling torch.Tensor and list types. Removing this dependency would simplify the codebase and reduce the overall package footprint.

gemini-code-assist · 2025-11-26T14:28:41Z

vllm_ascend/models/qwen2_5_vl_without_padding.py

+        if isinstance(grid_thw, list):
+            grid_thw_list = grid_thw
+            grid_thw = np.array(grid_thw, dtype=np.int32)
+        else:
+            grid_thw_list = grid_thw.tolist()
+            grid_thw = grid_thw.numpy()
+
+        pos_embeds = self.fast_pos_embed_interpolate(grid_thw_list)
        hidden_states = hidden_states + pos_embeds
-        rotary_pos_emb = self.rot_pos_emb(grid_thw)
+        rotary_pos_emb = self.rot_pos_emb(grid_thw_list)
        grid_thw_tensor = torch.tensor(grid_thw,
                                       device=self.device,
                                       dtype=torch.int32)


The current implementation of the forward method involves redundant data type conversions for grid_thw. Specifically, if grid_thw is a torch.Tensor, it's converted to a list, then to a numpy array, and finally back to a torch.Tensor. If grid_thw is a list, it's converted to a numpy array and then to a torch.Tensor. These intermediate conversions to numpy are inefficient and unnecessary. The grid_thw_list (a Python list) and grid_thw_tensor (a PyTorch tensor) can be prepared directly from the input grid_thw without involving numpy, improving performance and code clarity.

if isinstance(grid_thw, list): grid_thw_list = grid_thw grid_thw_tensor = torch.tensor(grid_thw, device=self.device, dtype=torch.int32) else: grid_thw_list = grid_thw.tolist() grid_thw_tensor = grid_thw.to(device=self.device, dtype=torch.int32) pos_embeds = self.fast_pos_embed_interpolate(grid_thw_list) hidden_states = hidden_states + pos_embeds rotary_pos_emb = self.rot_pos_emb(grid_thw_list)

wangxiyuan · 2025-11-27T02:43:29Z

@shen-shanshan

shen-shanshan · 2025-11-27T08:24:17Z

vllm_ascend/models/qwen2_5_vl_without_padding.py

+    def rot_pos_emb(self, grid_thw: list[list[int]]):
+        pos_ids = []
+        max_grid_size = max(max(h, w) for _, h, w in grid_thw)
+        for t, h, w in grid_thw:
+            hpos_ids = torch.arange(h).unsqueeze(1).expand(-1, w)
+            hpos_ids = hpos_ids.reshape(
+                h // self.spatial_merge_size,
+                self.spatial_merge_size,
+                w // self.spatial_merge_size,
+                self.spatial_merge_size,
+            )
+            hpos_ids = hpos_ids.permute(0, 2, 1, 3)
+            hpos_ids = hpos_ids.flatten()
+
+            wpos_ids = torch.arange(w).unsqueeze(0).expand(h, -1)
+            wpos_ids = wpos_ids.reshape(
+                h // self.spatial_merge_size,
+                self.spatial_merge_size,
+                w // self.spatial_merge_size,
+                self.spatial_merge_size,
+            )
+            wpos_ids = wpos_ids.permute(0, 2, 1, 3)
+            wpos_ids = wpos_ids.flatten()
+            pos_ids.append(
+                torch.stack([hpos_ids, wpos_ids], dim=-1).repeat(t, 1))
+        pos_ids = torch.cat(pos_ids, dim=0)
+        rotary_pos_emb_full = self.rotary_pos_emb(max_grid_size)
+        rotary_pos_emb = rotary_pos_emb_full[pos_ids].flatten(1)
+        return rotary_pos_emb


Could we directly use https://github.com/vllm-project/vllm/blob/v0.11.2/vllm/model_executor/models/qwen3_vl.py#L446-L457 ?

yes, i have deleted these code

MengqingCao · 2025-11-27T12:57:14Z

plz update the pr message and rebase your code to avoid CI failure of prefix caching

github-actions · 2025-11-28T06:25:23Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: 李少鹏 <[email protected]>

gemini-code-assist bot reviewed Nov 26, 2025

View reviewed changes

MengqingCao added ready read for review ready-for-test start test by label for PR labels Nov 27, 2025

wangxiyuan closed this Nov 27, 2025

wangxiyuan reopened this Nov 27, 2025

shen-shanshan reviewed Nov 27, 2025

View reviewed changes

github-actions bot added the merge-conflicts label Nov 28, 2025

shaopeng-666 added 4 commits November 28, 2025 14:40

fix qwen3vl ci error

5a77241

Signed-off-by: 李少鹏 <[email protected]>

fix qwen3vl ci error

a8a4b1b

Signed-off-by: 李少鹏 <[email protected]>

fix qwen3vl ci

b5bb1f4

Signed-off-by: 李少鹏 <[email protected]>

trigger my pr

1a42ce7

Signed-off-by: 李少鹏 <[email protected]>

shaopeng-666 force-pushed the fix_ci branch from 01003b8 to 1a42ce7 Compare November 28, 2025 06:46

shaopeng-666 closed this Nov 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix qwen3vl ci error #4476

fix qwen3vl ci error #4476

shaopeng-666 commented Nov 26, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 26, 2025

Uh oh!

gemini-code-assist bot Nov 26, 2025

Uh oh!

wangxiyuan commented Nov 27, 2025

Uh oh!

shen-shanshan Nov 27, 2025 •

edited

Loading

Uh oh!

shaopeng-666 Nov 28, 2025

Uh oh!

MengqingCao commented Nov 27, 2025

Uh oh!

github-actions bot commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix qwen3vl ci error #4476

fix qwen3vl ci error #4476

Conversation

shaopeng-666 commented Nov 26, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

wangxiyuan commented Nov 27, 2025

Uh oh!

shen-shanshan Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shaopeng-666 Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

MengqingCao commented Nov 27, 2025

Uh oh!

github-actions bot commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

shaopeng-666 commented Nov 26, 2025 •

edited by github-actions bot

Loading

shen-shanshan Nov 27, 2025 •

edited

Loading