1 3 10

Jianbo Wu

jwu323

AI & ML interests

None yet

Recent Activity

replied to their post 3 days ago

We are excited to announce a new internal project, Rome, focused on advancing LLM reasoning. The code and accompanying paper will be released soon. Stay tuned!

reacted to di-zhang-fdu's post with 👀 about 1 month ago

LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend. We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation.

reacted to di-zhang-fdu's post with 🚀 about 1 month ago

ChemVLM has been accepted by AAAI2025! https://huggingface.co/papers/2408.07246 Try have a chat wiht him🤗. https://huggingface.co/AI4Chem/ChemVLM-26B-1-2

View all activity

Organizations

jwu323's activity

replied to their post 3 days ago

Code available now: https://github.com/SimpleBerry/Rome

reacted to di-zhang-fdu's post with 👀 about 1 month ago

Post

2596

LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend.
We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation.

3 replies

reacted to di-zhang-fdu's post with 🚀 about 1 month ago

Post

1664

ChemVLM has been accepted by AAAI2025!
Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM (2408.07246)
Try have a chat wiht him🤗.
AI4Chem/ChemVLM-26B-1-2

upvoted a paper about 1 month ago

Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM

Paper • 2408.07246 • Published Aug 14, 2024 • 22

updated a model about 1 month ago

SimpleBerry/LLaMA-O1-Supervised-1129-GGML

Updated Dec 4, 2024 • 2

updated a Space about 1 month ago

Running

🐨

LlaMA-O1 Supervised 1129 GGUF

updated a model about 1 month ago

SimpleBerry/LLaMA-O1-Supervised-1129-Q2_K-GGUF

Updated Dec 4, 2024 • 7 • 1

liked 2 models about 1 month ago

SimpleBerry/LLaMA-O1-Supervised-1129-Q2_K-GGUF

Updated Dec 4, 2024 • 7 • 1

SimpleBerry/LLaMA-O1-Supervised-1129-GGML

Updated Dec 4, 2024 • 2

liked a Space about 1 month ago

Running

🐨

LlaMA-O1 Supervised 1129 GGUF

liked 2 datasets about 1 month ago

SimpleBerry/OpenLongCoT-SFT

Viewer • Updated Dec 2, 2024 • 332k • 67 • 15

SimpleBerry/OpenLongCoT-Pretrain-1202

Viewer • Updated Dec 2, 2024 • 135k • 48 • 2

liked a Space about 1 month ago

Running

🐨

LlaMA-O1 Supervised 1129 GGUF

liked 2 models about 1 month ago

SimpleBerry/LLaMA-O1-Base-1127

Text Generation • Updated Dec 3, 2024 • 54 • 17

SimpleBerry/LLaMA-O1-Supervised-1129

Text Generation • Updated Dec 3, 2024 • 465 • 18

reacted to di-zhang-fdu's post with 🚀 about 1 month ago

Post

3056

The first version of LLaMA-O1 has been uploaded to HF now!Here We Come!
Supervised:
SimpleBerry/LLaMA-O1-Supervised-1129
Base(Pretrain):
SimpleBerry/LLaMA-O1-Base-1127
Supervised Finetune Dataset:
SimpleBerry/OpenLongCoT-SFT
Pretraining Dataset:
SimpleBerry/OpenLongCoT-Pretrain-1202
RLHF is on the way! View our GitHub Repo:
https://github.com/SimpleBerry/LLaMA-O1
Our ongoing related researches:
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning (2411.18203)
@AdinaY @akhaliq @jwu323
------
GGUF:https://huggingface.co/Lyte/LLaMA-O1-Supervised-1129-Q4_K_M-GGUF
online Demo (CPU-only): SimpleBerry/LLaMA-O1-Supervised-1129-Demo

3 replies

reacted to di-zhang-fdu's post with 🚀 about 2 months ago

Post

1351

LLaMA-O1 Base and SFT model will be uploaded to HF today.
RLHF pipeline already ready, still waiting for data sampling.

1 reply

authored a paper about 2 months ago

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

Paper • 2411.18203 • Published Nov 27, 2024 • 32

upvoted a paper about 2 months ago

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

Paper • 2411.18203 • Published Nov 27, 2024 • 32

reacted to their post with 🚀 about 2 months ago

Post

1356

We are excited to announce a new internal project, Rome, focused on advancing LLM reasoning. The code and accompanying paper will be released soon. Stay tuned!

3 replies