MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification

[Huggingface Dataset] [MM-Verifier] [Paper]

Test-time scaling enables a model to generate more tokens during the inference stage, which is an effective approach to enhancing accuracy. Designing an effective verifier is key to significantly improving reasoning performance. However, in the multimodal (MM) domain, there is still a lack of a strong MM-Verifier. In this paper, we introduce MM-Verifier and MM-Reasoner to enhance multimodal reasoning through longer inference and more robust verification. First, we propose a two-step MM verification data synthesis method, which combines a simulation-based tree search with verification and uses rejection sampling to generate high-quality Chain-of-Thought (COT) data. This data is then used to fine-tune the verification model, MM-Verifier. Additionally, we present a more efficient method for synthesizing MMCOT data, bridging the gap between text-based and multimodal reasoning. The synthesized data is used to fine-tune MM-Reasoner.

💥 News 💥

[2025.05] 💥 Paper was accepted by ACL 2025 Main!
[2025.02.23] 💥 We released MM-Verifier model. [MM-Verifier]
[2025.02.23] 💥 We released training dataset of MM-Verifier and MM-Reasoner. [Huggingface Dataset]

MM-Verify

Search Algorithm
/search/eval.sh

We referred to the awesome work ResT-MCTS for the implementation of the search algorithm, thanks!

Each question is sampled $n$ times
/data_syn/sample_qwen2vl.py
Perform verify data annotation
/data_syn/orm_to_sft.py
Data cleaning
/data_syn/clean_ormData_mm_sample.py

MM-Reasoning

Use QwQ for data distillation
/data_syn/test4_mavis_vllm_slz.py
Perform data cleaning
/data_syn/clean_qwqData_mm.py

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data_syn		data_syn
search		search
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification

💥 News 💥

MM-Verify

MM-Reasoning

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification

💥 News 💥

MM-Verify

MM-Reasoning

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages