About reproducing results in qwen-2.5-vl-3b-instruct

Hello! I attempted to reproduce the code using the qwen2.5-vl-3b-instruct model, modifying the dimension of the hidden layer to 2048, and the loss curve is shown below. Evaluation was conducted using the file lora_stage234_merged. However, the results do not seem to contain any tokens such as think, answer, sam_pad, as shown in the figure. My modifications are as follows: using Ascend NPU for computation, while disabling flash_attn and GPU-related parameters. But this does not seem to resolve the issue where the answer output does not conform to the preset format. Could you please advise on the possible causes?

<img width="1624" height="316" alt="Image" src="https://github.com/user-attachments/assets/5416ec9c-215f-49aa-95d5-0d897934c977" />
<img width="1574" height="550" alt="Image" src="https://github.com/user-attachments/assets/e874f1c8-1ace-4d7f-8d20-f668c507536d" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About reproducing results in qwen-2.5-vl-3b-instruct #27

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

About reproducing results in qwen-2.5-vl-3b-instruct #27

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions