Jianbo Wu's picture

Jianbo Wu

jwu323

AI & ML interests

None yet

Recent Activity

Organizations

whoisking's profile picture SimpleBerry Research Lab's profile picture

jwu323's activity

replied to their post 3 days ago
reacted to di-zhang-fdu's post with ๐Ÿ‘€ about 1 month ago
view post
Post
2596
LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend.
We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation.
ยท
reacted to di-zhang-fdu's post with ๐Ÿš€ about 1 month ago
reacted to di-zhang-fdu's post with ๐Ÿš€ about 1 month ago
view post
Post
3056
  • 3 replies
ยท
reacted to di-zhang-fdu's post with ๐Ÿš€ about 2 months ago
view post
Post
1351
LLaMA-O1 Base and SFT model will be uploaded to HF today.
RLHF pipeline already ready, still waiting for data sampling.
  • 1 reply
ยท
reacted to their post with ๐Ÿš€ about 2 months ago
view post
Post
1356
We are excited to announce a new internal project, Rome, focused on advancing LLM reasoning. The code and accompanying paper will be released soon. Stay tuned!
ยท