arxiv:2410.10563
Dongfu Jiang
DongfuJiang
AI & ML interests
Large Language Model, Modality Reasoning and their evaluation
Recent Activity
updated
a Space
1 day ago
TIGER-Lab/GenAI-Arena
liked
a dataset
1 day ago
tomg-group-umd/pixelprose
liked
a model
2 days ago
microsoft/phi-4
Organizations
Papers
10
models
38
DongfuJiang/Qwen2-VL-VAE-7B-Instruct
Image-Text-to-Text
•
Updated
•
405
DongfuJiang/Qwen2-VL-VAE-7B-Instruct-mochi-vae
Text2Text Generation
•
Updated
•
76
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_pt
Text Generation
•
Updated
•
19
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_sft
Text Generation
•
Updated
•
13
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_pt
Text Generation
•
Updated
•
17
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_sft
Text Generation
•
Updated
•
16
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_pt
Text Generation
•
Updated
•
9
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_sft
Updated
DongfuJiang/prm_gsm_2k_with_full_sol_mix_ref_remove_all_correct_hf
Text Generation
•
Updated
•
21
•
1
DongfuJiang/prm_qwen25_math_gsm_2k_with_full_sol_mix_ref_redistribution_hf
Text Generation
•
Updated
•
210
datasets
12
DongfuJiang/PRM_SFT
Viewer
•
Updated
•
4.01M
•
30
DongfuJiang/zeroeval
Viewer
•
Updated
•
13.5k
•
34
DongfuJiang/PRM_eval
Viewer
•
Updated
•
9.54k
•
30
DongfuJiang/eval
Viewer
•
Updated
•
6k
•
31
DongfuJiang/PRM_prepared
Viewer
•
Updated
•
39.9k
•
32
DongfuJiang/PRM_train
Viewer
•
Updated
•
32.7k
•
30
DongfuJiang/MATH-500
Viewer
•
Updated
•
500
•
183
DongfuJiang/simpo_v2_ultrafeedback
Viewer
•
Updated
•
59.9k
•
28
DongfuJiang/VAPO
Viewer
•
Updated
•
72.5k
•
31
DongfuJiang/PairRM-data
Viewer
•
Updated
•
586k
•
30