Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Replicate
SambaNova
Novita
Hyperbolic
fal
Nebius AI Studio
Fireworks
Cerebras
Together AI
HF Inference API
Misc
Reset Misc
grpo
Inference Endpoints
text-generation-inference
AutoTrain Compatible
4-bit precision
Eval Results
Carbon Emissions
8-bit precision
custom_code
Misc with no match
Merge
text-embeddings-inference
Mixture of Experts
Apply filters
Models
1,239
Full-text search
Edit filters
Sort: Trending
Active filters:
grpo
Clear all
Augerau/qwen2.5-grpo-gsm8k
Text Generation
•
Updated
about 7 hours ago
•
4
mradermacher/Qwen-2.5-Math-7B-Max-v3-GGUF
Updated
about 14 hours ago
•
244
ksanjeeb/Guru-R0
Updated
about 14 hours ago
•
62
Typemaster32/deepseek-v1.5b-finetuned
Updated
about 11 hours ago
•
34
krinetic1234/Llama-3B-Open-R1-GRPO
Text Generation
•
Updated
about 8 hours ago
niklasm222/qwen2.5-3b-grpo-1.7k-gsm8k-prolog-v1
Text Generation
•
Updated
about 4 hours ago
Augerau/qwen2.5-grpo-gsm8k-rtx3060
Text Generation
•
Updated
about 1 hour ago
quinnhe/llama3.1_8b_grpo
Text Generation
•
Updated
about 3 hours ago
sravanthib/multinode-try
Updated
2 minutes ago
Previous
1
...
40
41
42
Next