Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Creekside
/
Qwen-3B-gsm8k-GRPO
like
1
Transformers
Safetensors
GGUF
English
qwen2
text-generation-inference
unsloth
trl
grpo
Inference Endpoints
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen-3B-gsm8k-GRPO
Commit History
(Trained with Unsloth)
68a43ce
verified
Creekside
commited on
about 22 hours ago
(Trained with Unsloth)
e5ec123
verified
Creekside
commited on
about 22 hours ago
(Trained with Unsloth)
c0ebb68
verified
Creekside
commited on
about 22 hours ago
(Trained with Unsloth)
81382bb
verified
Creekside
commited on
about 22 hours ago
Trained with Unsloth
397ad9e
verified
Creekside
commited on
about 22 hours ago
Upload tokenizer
8a89dd6
verified
Creekside
commited on
about 22 hours ago
Upload README.md with huggingface_hub
b7de59a
verified
Creekside
commited on
about 22 hours ago
initial commit
cc62a8a
verified
Creekside
commited on
about 22 hours ago