feat: support Qwen3-VL model on npu device. #295

wly-115 · 2025-10-29T10:09:59Z

feat:
1.support Qwen3-VL.
2.Qwen3-VL-MOE is currently not supported.

yq33victor · 2025-11-03T13:11:32Z

xllm/core/layers/npu/npu_qwen3_moe_decoder_layer_impl.cpp

    const ParallelArgs& parallel_args) {
  param.hasSharedExpert = (args.n_shared_experts() > 0);
-  param.hasSharedExpertGate = true;
+  param.hasSharedExpertGate = false;


other qwen models will be effected by this modification.

yq33victor · 2025-11-03T13:12:43Z

xllm/core/layers/npu/npu_qwen3_moe_decoder_layer_impl.cpp

  const int local_index = expert_index % num_experts_per_partition_;
  const bool is_sharded = shard_map.count(index);

-  std::lock_guard<std::mutex> lock(experts_mutex_);


yq33victor · 2025-11-03T13:13:46Z

xllm/models/llm/llm_model_base.h


  void load_model(std::unique_ptr<ModelLoader> loader,
-                  std::string prefix = "" /*llm model weight prefix*/) {
+                  std::string prefix = "model." /*llm model weight prefix*/) {


nit: all models has the prefix model. ?

It seems that the llm_head does not have this prefix.

lm_head_->load_state_dict( state_dict->get_dict_with_prefix(prefix + "lm_head."));

yq33victor · 2025-11-03T13:22:56Z

xllm/models/llm/qwen3_moe.h

+    return embed_tokens_(input_ids, 0);
+#elif defined(USE_MLU)
+    return embed_tokens_(input_ids);
+#endif


nit: Defensive programming is needed here.

#else LOG(FATAL) << "...."

wly-115 requested review from liutongxuan and yq33victor October 29, 2025 10:10

wly-115 force-pushed the qwen3-vl branch from d8a42e4 to 2e729bf Compare October 29, 2025 10:11

wly-115 requested a review from xiao-yu-chen November 2, 2025 06:43

wly-115 force-pushed the qwen3-vl branch 5 times, most recently from 39f3227 to d08f719 Compare November 3, 2025 12:59

yq33victor reviewed Nov 3, 2025

View reviewed changes

feat: support Qwen3-VL.

04b7bd9

wly-115 force-pushed the qwen3-vl branch from d08f719 to 04b7bd9 Compare November 3, 2025 13:57

liutongxuan changed the title ~~feat: support Qwen3-VL.~~ feat: support Qwen3-VL model on npu device. Nov 4, 2025

liutongxuan approved these changes Nov 4, 2025

View reviewed changes

yq33victor approved these changes Nov 4, 2025

View reviewed changes

liutongxuan merged commit 3028d49 into jd-opensource:main Nov 4, 2025
5 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: support Qwen3-VL model on npu device. #295

feat: support Qwen3-VL model on npu device. #295

Uh oh!

wly-115 commented Oct 29, 2025

Uh oh!

yq33victor Nov 3, 2025

Uh oh!

yq33victor Nov 3, 2025

Uh oh!

yq33victor Nov 3, 2025

Uh oh!

yq33victor Nov 3, 2025

Uh oh!

wly-115 Nov 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: support Qwen3-VL model on npu device. #295

feat: support Qwen3-VL model on npu device. #295

Uh oh!

Conversation

wly-115 commented Oct 29, 2025

Uh oh!

yq33victor Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

yq33victor Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

yq33victor Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

yq33victor Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

wly-115 Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants