Haiquan Zhao's picture

2 4

Haiquan Zhao

haidequanbu

·

https://haidequanbu.github.io

haidequanbu

AI & ML interests

Natural Language Processing, LLM safety

Recent Activity

upvoted a paper 11 days ago

Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance

upvoted a collection about 2 months ago

liked a Space 4 months ago

Qwen/Qwen2-VL

View all activity

Organizations

None yet

haidequanbu's activity

upvoted a paper 11 days ago

Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance

Paper • 2502.16944 • Published 14 days ago • 10

upvoted a collection about 2 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 12 days ago • 554

liked a Space 4 months ago

Qwen2-VL-72B

Engage in multi-modal conversations with images and videos

authored a paper 4 months ago

Reflection-Bench: probing AI intelligence with reflection

Paper • 2410.16270 • Published Oct 21, 2024 • 6

liked a model 4 months ago

haidequanbu/ESC-RANK

Updated Jul 17, 2024 • 9 • 3

updated a model 8 months ago

haidequanbu/ESC-RANK

Updated Jul 17, 2024 • 9 • 3

liked a model 9 months ago

haidequanbu/ESC-Role

Text Generation • Updated Jun 21, 2024 • 19 • 4

updated a model 9 months ago

haidequanbu/ESC-Role

Text Generation • Updated Jun 21, 2024 • 19 • 4

liked a dataset about 1 year ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 12.6k • 1.29k