Dongfu Jiang's picture

Dongfu Jiang

DongfuJiang

·

https://jdf-prog.github.io/

AI & ML interests

Large Language Model, Modality Reasoning and their evaluation

Recent Activity

updated a Space 1 day ago

TIGER-Lab/GenAI-Arena

liked a dataset 1 day ago

tomg-group-umd/pixelprose

liked a model 2 days ago

microsoft/phi-4

View all activity

Organizations

Papers 10

arxiv:2410.10563

arxiv:2406.15252

arxiv:2406.11069

arxiv:2406.04485

models 38

DongfuJiang/Qwen2-VL-VAE-7B-Instruct

Image-Text-to-Text • Updated 25 days ago • 405

DongfuJiang/Qwen2-VL-VAE-7B-Instruct-mochi-vae

Text2Text Generation • Updated 25 days ago • 76

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_pt

Text Generation • Updated Dec 9, 2024 • 19

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_sft

Text Generation • Updated Dec 9, 2024 • 13

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_pt

Text Generation • Updated Dec 9, 2024 • 17

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_sft

Text Generation • Updated Dec 9, 2024 • 16

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_pt

Text Generation • Updated Dec 7, 2024 • 9

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_sft

Updated Dec 7, 2024

DongfuJiang/prm_gsm_2k_with_full_sol_mix_ref_remove_all_correct_hf

Text Generation • Updated Dec 1, 2024 • 21 • 1

DongfuJiang/prm_qwen25_math_gsm_2k_with_full_sol_mix_ref_redistribution_hf

Text Generation • Updated Dec 1, 2024 • 210

datasets 12

DongfuJiang/PRM_SFT

Viewer • Updated Dec 1, 2024 • 4.01M • 30

DongfuJiang/zeroeval

Viewer • Updated Nov 27, 2024 • 13.5k • 34

DongfuJiang/PRM_eval

Viewer • Updated Nov 27, 2024 • 9.54k • 30

DongfuJiang/eval

Viewer • Updated Nov 27, 2024 • 6k • 31

DongfuJiang/PRM_prepared

Viewer • Updated Nov 26, 2024 • 39.9k • 32

DongfuJiang/PRM_train

Viewer • Updated Nov 25, 2024 • 32.7k • 30

DongfuJiang/MATH-500

Viewer • Updated Nov 6, 2024 • 500 • 183

DongfuJiang/simpo_v2_ultrafeedback

Viewer • Updated Aug 2, 2024 • 59.9k • 28

DongfuJiang/VAPO

Viewer • Updated Jul 31, 2024 • 72.5k • 31

DongfuJiang/PairRM-data

Viewer • Updated Jul 30, 2024 • 586k • 30