Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Augerau
/
qwen2.5-grpo-gsm8k-rtx3060
like
0
Transformers
Safetensors
GGUF
English
qwen2
text-generation-inference
unsloth
trl
grpo
Inference Endpoints
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
qwen2.5-grpo-gsm8k-rtx3060
Commit History
(Trained with Unsloth)
9af2c9f
verified
Augerau
commited on
1 day ago
(Trained with Unsloth)
5e92633
verified
Augerau
commited on
1 day ago
Trained with Unsloth
94a3f62
verified
Augerau
commited on
1 day ago
Trained with Unsloth
369597b
verified
Augerau
commited on
1 day ago
Upload tokenizer
c8355ad
verified
Augerau
commited on
1 day ago
Upload README.md with huggingface_hub
5324580
verified
Augerau
commited on
1 day ago
initial commit
94d6847
verified
Augerau
commited on
1 day ago