Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Cran-May
/
CohenQu-DeepSeek-R1-Distill-Qwen-1.5B-GRPO-duplicate-fixed-6140715-Q5_K_M-GGUF
like
2
Transformers
GGUF
hf-cmu-collab/DeepScaleR-1.5B-Preview_on-policy_GRPO
Generated from Trainer
trl
grpo
llama-cpp
gguf-my-repo
Inference Endpoints
imatrix
conversational
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
ee27f80
CohenQu-DeepSeek-R1-Distill-Qwen-1.5B-GRPO-duplicate-fixed-6140715-Q5_K_M-GGUF
1 contributor
History:
2 commits
Cran-May
Upload cohenqu-deepseek-r1-distill-qwen-1.5b-grpo-duplicate-fixed-6140715-q5_k_m-imat.gguf with huggingface_hub
ee27f80
verified
9 days ago
.gitattributes
1.64 kB
Upload cohenqu-deepseek-r1-distill-qwen-1.5b-grpo-duplicate-fixed-6140715-q5_k_m-imat.gguf with huggingface_hub
9 days ago
cohenqu-deepseek-r1-distill-qwen-1.5b-grpo-duplicate-fixed-6140715-q5_k_m-imat.gguf
1.29 GB
LFS
Upload cohenqu-deepseek-r1-distill-qwen-1.5b-grpo-duplicate-fixed-6140715-q5_k_m-imat.gguf with huggingface_hub
9 days ago