arxiv:2410.10563
Dongfu Jiang
DongfuJiang
AI & ML interests
Large Language Model, Modality Reasoning and their evaluation
Recent Activity
updated
a Space
about 13 hours ago
TIGER-Lab/GenAI-Arena
liked
a dataset
3 days ago
tomg-group-umd/pixelprose
liked
a model
3 days ago
microsoft/phi-4
Organizations
Papers
10
models
38
DongfuJiang/Qwen2-VL-VAE-7B-Instruct
Image-Text-to-Text
•
Updated
•
405
DongfuJiang/Qwen2-VL-VAE-7B-Instruct-mochi-vae
Text2Text Generation
•
Updated
•
76
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_pt
Text Generation
•
Updated
•
19
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_sft
Text Generation
•
Updated
•
13
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_pt
Text Generation
•
Updated
•
17
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_sft
Text Generation
•
Updated
•
14
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_pt
Text Generation
•
Updated
•
9
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_sft
Updated
DongfuJiang/prm_gsm_2k_with_full_sol_mix_ref_remove_all_correct_hf
Text Generation
•
Updated
•
21
•
1
DongfuJiang/prm_qwen25_math_gsm_2k_with_full_sol_mix_ref_redistribution_hf
Text Generation
•
Updated
•
191
datasets
12
DongfuJiang/PRM_SFT
Viewer
•
Updated
•
4.01M
•
31
DongfuJiang/zeroeval
Viewer
•
Updated
•
13.5k
•
34
DongfuJiang/PRM_eval
Viewer
•
Updated
•
9.54k
•
31
DongfuJiang/eval
Viewer
•
Updated
•
6k
•
32
DongfuJiang/PRM_prepared
Viewer
•
Updated
•
39.9k
•
33
DongfuJiang/PRM_train
Viewer
•
Updated
•
32.7k
•
31
DongfuJiang/MATH-500
Viewer
•
Updated
•
500
•
184
DongfuJiang/simpo_v2_ultrafeedback
Viewer
•
Updated
•
59.9k
•
27
DongfuJiang/VAPO
Viewer
•
Updated
•
72.5k
•
30
DongfuJiang/PairRM-data
Viewer
•
Updated
•
586k
•
29