Open R1

Enterprise

community

https://github.com/huggingface/open-r1

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

guipenedo updated a dataset about 19 hours ago

open-r1/OpenThoughts-114k-math

lewtun updated a collection about 21 hours ago

🧠 Reasoning datasets

loubnabnl updated a collection 2 days ago

🧠 Reasoning datasets

View all activity

open-r1's activity

guipenedo

updated a dataset about 19 hours ago

open-r1/OpenThoughts-114k-math

Viewer • Updated about 19 hours ago • 89.1k • 13 • 10

lewtun

updated a collection about 21 hours ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 5 items • Updated about 19 hours ago • 15

cfahlgren1

posted an update 2 days ago

Post

1636

If you haven't seen yet, we just released Inference Providers 🔀

> 4 new serverless inference providers on the Hub 🤯
> Use your HF API key or personal key with all providers 🔑
> Chat with Deepseek R1, V3, and more on HF Hub 🐋
> We support Sambanova, TogetherAI, Replicate, and Fal.ai 💪

Best of all, we don't charge any markup on top of the provider 🫰 Have you tried it out yet? HF Pro accounts get $2 of free usage for the provider inference.

loubnabnl

updated a collection 2 days ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 5 items • Updated about 19 hours ago • 15

edbeeching

in open-r1/README 3 days ago

Reproducing Deepseek's numbers for MATH-500

#3 opened 3 days ago by

edbeeching

eliebak

in open-r1/README 3 days ago

Recommend a dataset in the scientific domain made by us: EricLu/SCP-116K

#2 opened 3 days ago by

EricLu

LLM Benchmarks and Data Leakage

#1 opened 3 days ago by

dvamvour

eliebak

updated a Space 3 days ago

Running

📈

README

loubnabnl

updated a Space 3 days ago

Running

📈

README

loubnabnl

updated a collection 3 days ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 5 items • Updated about 19 hours ago • 15

eliebak

published a Space 3 days ago

Running

📈

README

burtenshaw

posted an update 4 days ago

Post

2310

Manic few days in open source AI, with game changing development all over the place. Here's a round up of the resources:

- The science team at @huggingface reproduced and open source the seek r1. https://github.com/huggingface/open-r1
- @qwen released a series of models with 1 million token context! https://qwenlm.github.io/blog/qwen2.5-1m/
- SmolVLM got even smaller with completely new variants at 256m and 500m https://huggingface.co/blog/smolervlm

There's so much you could do with these developments. Especially combining them together into agentic applications or fine-tuning them on your use case.

1 reply

lewtun

posted an update 6 days ago

Post

9484

We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open!

🧪 Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1.

🧠 Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code.

🔥 Step 3: show we can go from base model -> SFT -> RL via multi-stage training.

Follow along: https://github.com/huggingface/open-r1

5 replies

burtenshaw

posted an update 7 days ago

Post

683

Hey 👋

I'm helping out on some community research to learn about the AI community. If you want to join in the conversation, head over here where I started a community discussion on the most influential model since BERT.

OSAIResearchCommunity/README#2

burtenshaw

posted an update 7 days ago

Post

1446

📣 Teachers and Students! Here's a handy quiz app if you're preparing your own study material.

TLDR, It's a quiz that uses a dataset to make questions and save answers

Here's how it works:

- make a dataset of multiple choice questions
- duplicate the space add set the dataset repo
- log in and do the quiz
- submit the questions to create a new dataset

I made this to get ready for the agents course, but I hope it's useful for you projects too!

quiz app burtenshaw/dataset_quiz

dataset with questions burtenshaw/exam_questions

agents course we're working on https://huggingface.co/agents-course

burtenshaw

posted an update 7 days ago

Post

2091

AI was built on side projects!

andito

posted an update 8 days ago

Post

1451

𝗜𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝗶𝗻𝗴 𝘁𝗵𝗲 𝘄𝗼𝗿𝗹𝗱'𝘀 𝘀𝗺𝗮𝗹𝗹𝗲𝘀𝘁 𝘃𝗶𝘀𝗶𝗼𝗻 𝗹𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗺𝗼𝗱𝗲𝗹!

We’re thrilled to share 𝗦𝗺𝗼𝗹𝗩𝗟𝗠 (256M & 500M)—the smallest Visual Language Models ever built. Think: running on <1GB of GPU memory—you can fine-tune it on your laptop and run it on your toaster!

Why It’s Game-Changing:
- 𝗢𝘂𝘁𝗽𝗲𝗿𝗳𝗼𝗿𝗺𝘀 𝗟𝗮𝗿𝗴𝗲𝗿 𝗠𝗼𝗱𝗲𝗹𝘀: Even the 256M model surpasses our SOTA 80B-parameter model from just 17 months ago. Over 300x reduction!
𝗠𝗶𝗴𝗵𝘁𝘆 𝗘𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝗰𝘆: The 256M version delivers 80% of our 2.2B model’s performance, and the 500M version hits 90%
𝗟𝗶𝗴𝗵𝘁𝗻𝗶𝗻𝗴-𝗙𝗮𝘀𝘁 𝗦𝗲𝗮𝗿𝗰𝗵: SmolVLM integrates with ColiPali for state-of-the-art retrieval speeds—on par with models 10x bigger. That means cheaper, faster indexing and real-world impact.

What’s New Under the Hood:
- 𝗡𝗲𝘄 𝗩𝗶𝘀𝗶𝗼𝗻 𝗘𝗻𝗰𝗼𝗱𝗲𝗿: Smaller overall size (400M -> 93M), but with higher resolution.
- 𝗛𝗶𝗴𝗵𝗲𝗿 𝗣𝗶𝘅𝗲𝗹𝘀/𝗧𝗼𝗸𝗲𝗻: 4096 vs. 1820—more efficient image processing.
- 𝗦𝗺𝗮𝗿𝘁 𝗧𝗼𝗸𝗲𝗻𝗶𝘇𝗮𝘁𝗶𝗼𝗻: Faster training and a performance boost.

Check our blog: https://huggingface.co/blog/smolervlm
The models: HuggingFaceTB/smolvlm-256m-and-500m-6791fafc5bb0ab8acc960fb0
The demo: HuggingFaceTB/SmolVLM-256M-Demo

1 reply

burtenshaw

posted an update 9 days ago

Post

3488

🚧 Work in Progress! 🚧

👷‍♀️ We're working hard on getting the official agents course ready for the 50,000 students that have signed up.

If you want to contribute to the discussion, I started these community posts. Looking forward to hearing from you:

- smolagents unit in the agents course - agents-course/README#7
- LlamaIndex Unit in the agents course - agents-course/README#6
- LangChain and LangGraph unit in the agents course - agents-course/README#5
- Real world use cases in the agents course - agents-course/README#8

AI & ML interests

Recent Activity

Team members 26

open-r1's activity

Reproducing Deepseek's numbers for MATH-500

Recommend a dataset in the scientific domain made by us: EricLu/SCP-116K

LLM Benchmarks and Data Leakage

README

README

README