Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Together AI
Novita
Cerebras
Hyperbolic
Fireworks
fal
SambaNova
Replicate
Nebius AI Studio
HF Inference API
Misc
Reset Misc
GRPO
Inference Endpoints
text-generation-inference
AutoTrain Compatible
Merge
4-bit precision
custom_code
Misc with no match
Eval Results
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
66
Full-text search
Edit filters
Sort: Trending
Active filters:
GRPO
Clear all
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
Updated
Feb 9
•
120
alpha-ai/Deep-Reason-SMALL-V0-GGUF
Updated
15 days ago
•
283
•
1
alpha-ai/Deep-Reason-SMALL-V0
Text Generation
•
Updated
15 days ago
•
37
•
2
alpha-ai/qwen2.5-reason-thought-lite-GGUF
Updated
15 days ago
•
301
alpha-ai/qwen2.5-reason-thought-lite
Text Generation
•
Updated
15 days ago
•
34
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite
Text Generation
•
Updated
15 days ago
•
52
Daemontatox/Cogito-R1
Text Generation
•
Updated
22 days ago
•
347
•
5
mradermacher/Cogito-R1-GGUF
Updated
28 days ago
•
1.11k
accuracy-maker/Llama-3.2-1B-GRPO-gsm8k
Text Generation
•
Updated
29 days ago
•
16
mradermacher/Cogito-R1-i1-GGUF
Updated
28 days ago
•
1.85k
alpha-ai/Reason-With-Choice-3B-GGUF
Updated
15 days ago
•
605
alpha-ai/Reason-With-Choice-3B
Text Generation
•
Updated
15 days ago
•
66
mradermacher/Reason-With-Choice-3B-GGUF
Updated
24 days ago
•
363
Daemontatox/PathFinderAI-S1
Text Generation
•
Updated
22 days ago
•
257
mradermacher/SmolLM2_135M_Grpo_Checkpoint-GGUF
Updated
22 days ago
•
283
mradermacher/SmolLM2_135M_Grpo_Gsm8k-GGUF
Updated
22 days ago
•
261
mradermacher/SmolLM2_135M_Grpo_Gsm8k-i1-GGUF
Updated
22 days ago
•
484
mradermacher/PathFinderAI-S1-GGUF
Updated
21 days ago
•
700
TimeLordRaps/PathFinderAI-S1-Q4_K_M-GGUF
Text Generation
•
Updated
22 days ago
•
38
mradermacher/SmolLM2_135M_Grpo_Checkpoint-i1-GGUF
Updated
22 days ago
•
462
mradermacher/PathFinderAI-S1-i1-GGUF
Updated
21 days ago
•
1.03k
Rivaidan/Captain-Eris_Violet-GRPO-v0.420-Q8_0-GGUF
Updated
17 days ago
•
32
nharshavardhana/SmolGRPO-135M
Text Generation
•
Updated
9 days ago
•
10
TheMelonGod/Captain-Eris_Violet-GRPO-v0.420-exl2
Text Generation
•
Updated
3 days ago
•
96
Lingyue1/SmolGRPO-135M
Text Generation
•
Updated
8 days ago
•
3
t2190/SmolGRPO-135M
Text Generation
•
Updated
7 days ago
•
23
t2190/GRPO_1
Text Generation
•
Updated
about 13 hours ago
•
11
kaweizhenpi/SmolGRPO-135M
Text Generation
•
Updated
6 days ago
•
8
Shumatsurontek/SmolGRPO-135M
Text Generation
•
Updated
4 days ago
alperenyildiz/SmolGRPO-135M
Text Generation
•
Updated
2 days ago
•
19
Previous
1
2
3
Next