Skip to content

fix(data): clarify _nothink suffix warning for reasoning-only models#10613

Merged
hiyouga merged 1 commit into
hiyouga:mainfrom
GSCSD1:GSCSD1/data/fix-misleading-nothink-warning
Jun 30, 2026
Merged

fix(data): clarify _nothink suffix warning for reasoning-only models#10613
hiyouga merged 1 commit into
hiyouga:mainfrom
GSCSD1:GSCSD1/data/fix-misleading-nothink-warning

Conversation

@GSCSD1

@GSCSD1 GSCSD1 commented Jun 30, 2026

Copy link
Copy Markdown
Contributor

Fixes #10566


get_template_and_fix_tokenizer 中的警告无条件建议添加 _nothink 后缀,对于仅发布推理模型(无 Instruct 版本)的模型系列(如 Qwen3.6)会产生误导。

修改警告文本,明确说明:

  • 只有基础模型不是推理模型时需要加后缀
  • 纯推理模型系列无需加后缀

This contribution was prepared with AI assistance.

The previous warning unconditionally advised adding `_nothink` suffix,
which is misleading for model families like Qwen3.6 that only ship
reasoning variants and have no Instruct counterpart.

Closes hiyouga#10566

Co-Authored-By: Claude <noreply@anthropic.com>

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the warning message in src/llamafactory/data/template.py to clarify when the _nothink suffix should be used with reasoning templates. The reviewer suggested refining the terminology in the warning message—specifically replacing "base model" with "model" and clarifying the parenthetical explanation—to avoid confusing users regarding base versus instruct models.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

Comment on lines +658 to 662
"You are using reasoning template. "
"If the base model is NOT a reasoning model (i.e., it has a separate Instruct variant), "
"please add `_nothink` suffix to disable thinking. "
"For reasoning-only model families (e.g., Qwen3.6), the suffix is not needed. "
"e.g., qwen3_vl_nothink"

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

在 LLM 社区中,“base model”(基座模型)通常特指未经指令微调(SFT)的预训练模型,而“Instruct variant”则是指对话/指令模型。在这里使用“base model”可能会让用户产生混淆(例如,用户可能会认为自己使用的是 Instruct 模型,因此不属于 base model)。

此外,“it has a separate Instruct variant” 的表述也有些令人困惑,因为如果用户正在使用非推理的 Instruct 模型,该模型本身就是 Instruct 版本,而不是“拥有”一个独立的 Instruct 版本。

建议将 “base model” 改为 “model”,并将括号内的说明调整为更准确的表述,例如 (e.g., a standard Instruct variant)

Suggested change
"You are using reasoning template. "
"If the base model is NOT a reasoning model (i.e., it has a separate Instruct variant), "
"please add `_nothink` suffix to disable thinking. "
"For reasoning-only model families (e.g., Qwen3.6), the suffix is not needed. "
"e.g., qwen3_vl_nothink"
"You are using reasoning template. "
"If the model is NOT a reasoning model (e.g., a standard Instruct variant), "
"please add '_nothink' suffix to disable thinking. "
"For reasoning-only model families (e.g., Qwen3.6), the suffix is not needed. "
"e.g., qwen3_vl_nothink"

@hiyouga hiyouga merged commit a48af5c into hiyouga:main Jun 30, 2026
15 of 16 checks passed
@hiyouga hiyouga added the solved This problem has been already solved label Jun 30, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

solved This problem has been already solved

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Doc] Misleading _nothink suffix warning for reasoning-only model families (e.g. Qwen3.6)

2 participants