Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
CohenQu
/
DeepSeek-R1-Distill-Qwen-7B-GRPO
like
4
Text Generation
Transformers
Safetensors
hf-cmu-collab/DeepScaleR-1.5B-Preview_on-policy_GRPO
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-7B-GRPO
/
model.safetensors
Commit History
Training in progress, step 24
1668fba
verified
CohenQu
commited on
2 days ago
Training in progress, step 23
2794326
verified
CohenQu
commited on
2 days ago
Training in progress, step 22
318e23d
verified
CohenQu
commited on
2 days ago
Training in progress, step 21
642ca3c
verified
CohenQu
commited on
2 days ago
Training in progress, step 20
a2aa375
verified
CohenQu
commited on
2 days ago
Training in progress, step 24
1c16e29
verified
CohenQu
commited on
2 days ago
Training in progress, step 19
e21d0c3
verified
CohenQu
commited on
2 days ago
Training in progress, step 23
771ae2f
verified
CohenQu
commited on
2 days ago
Training in progress, step 22
e1ca79a
verified
CohenQu
commited on
2 days ago
Training in progress, step 18
20bc8a1
verified
CohenQu
commited on
2 days ago
Training in progress, step 21
8b876cf
verified
CohenQu
commited on
2 days ago
Training in progress, step 17
0f8817a
verified
CohenQu
commited on
2 days ago
Training in progress, step 20
91db2cf
verified
CohenQu
commited on
2 days ago
Training in progress, step 16
b4ba85f
verified
CohenQu
commited on
2 days ago
Training in progress, step 19
29255ac
verified
CohenQu
commited on
2 days ago
Training in progress, step 15
3461150
verified
CohenQu
commited on
2 days ago
Training in progress, step 18
bb9e530
verified
CohenQu
commited on
2 days ago
Training in progress, step 14
6388f46
verified
CohenQu
commited on
2 days ago
Training in progress, step 17
476acf8
verified
CohenQu
commited on
2 days ago
Training in progress, step 16
665841c
verified
CohenQu
commited on
2 days ago
Training in progress, step 13
405c929
verified
CohenQu
commited on
2 days ago
Training in progress, step 15
44013c0
verified
CohenQu
commited on
2 days ago
Training in progress, step 12
a294d24
verified
CohenQu
commited on
2 days ago
Training in progress, step 14
c4aa3e7
verified
CohenQu
commited on
2 days ago
Training in progress, step 11
cb393aa
verified
CohenQu
commited on
2 days ago
Training in progress, step 13
738cec4
verified
CohenQu
commited on
2 days ago
Training in progress, step 10
a5fe840
verified
CohenQu
commited on
2 days ago
Training in progress, step 12
65471be
verified
CohenQu
commited on
2 days ago
Training in progress, step 11
bcaee7b
verified
CohenQu
commited on
2 days ago
Training in progress, step 9
eb09bb6
verified
CohenQu
commited on
2 days ago
Training in progress, step 10
92c3da1
verified
CohenQu
commited on
2 days ago
Training in progress, step 8
b6c01bc
verified
CohenQu
commited on
2 days ago
Training in progress, step 9
0b5108b
verified
CohenQu
commited on
2 days ago
Training in progress, step 8
50bd66f
verified
CohenQu
commited on
2 days ago
Training in progress, step 7
3d4efcd
verified
CohenQu
commited on
2 days ago
Training in progress, step 7
5c0530b
verified
CohenQu
commited on
2 days ago
Training in progress, step 6
41c5db2
verified
CohenQu
commited on
2 days ago
Training in progress, step 6
2a5872c
verified
CohenQu
commited on
2 days ago
Training in progress, step 5
95b76ae
verified
CohenQu
commited on
2 days ago
Training in progress, step 5
2c40f2d
verified
CohenQu
commited on
2 days ago
Training in progress, step 4
f8bc4cf
verified
CohenQu
commited on
2 days ago
Training in progress, step 4
a8d4834
verified
CohenQu
commited on
2 days ago
Training in progress, step 3
d7aff0a
verified
CohenQu
commited on
2 days ago
Training in progress, step 3
a6b4e49
verified
CohenQu
commited on
2 days ago
Training in progress, step 2
263edf5
verified
CohenQu
commited on
2 days ago
Training in progress, step 2
60c8c20
verified
CohenQu
commited on
2 days ago
Training in progress, step 1
c3f0f82
verified
CohenQu
commited on
2 days ago
Training in progress, step 4
9b700dc
verified
CohenQu
commited on
4 days ago
Training in progress, step 2
fca5cb7
verified
CohenQu
commited on
4 days ago
Training in progress, step 3
93fc97b
verified
CohenQu
commited on
4 days ago
Previous
1
2
3
...
7
Next