Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Cran-May
/
CohenQu-DeepSeek-R1-Distill-Qwen-1.5B-GRPO-duplicate-fixed-6140715-Q5_K_M-GGUF
like
2
Transformers
GGUF
hf-cmu-collab/DeepScaleR-1.5B-Preview_on-policy_GRPO
Generated from Trainer
trl
grpo
llama-cpp
gguf-my-repo
Inference Endpoints
imatrix
conversational
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
CohenQu-DeepSeek-R1-Distill-Qwen-1.5B-GRPO-duplicate-fixed-6140715-Q5_K_M-GGUF
/
README.md
Commit History
Upload README.md with huggingface_hub
1bda063
verified
Cran-May
commited on
9 days ago