Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
CohenQu
/
DeepSeek-R1-Distill-Qwen-7B-GRPO
like
4
Text Generation
Transformers
Safetensors
hf-cmu-collab/DeepScaleR-1.5B-Preview_on-policy_GRPO
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-7B-GRPO
Commit History
Training in progress, step 8
1100c23
verified
CohenQu
commited on
7 days ago
Training in progress, step 8
b8a6fb5
verified
CohenQu
commited on
7 days ago
Training in progress, step 7
fe766d9
verified
CohenQu
commited on
7 days ago
Training in progress, step 7
e1bfaaf
verified
CohenQu
commited on
7 days ago
Training in progress, step 6
9305a89
verified
CohenQu
commited on
7 days ago
Training in progress, step 6
074aa8b
verified
CohenQu
commited on
7 days ago
Training in progress, step 5
d89ea43
verified
CohenQu
commited on
7 days ago
Training in progress, step 5
5129479
verified
CohenQu
commited on
7 days ago
Training in progress, step 4
29243b5
verified
CohenQu
commited on
7 days ago
Training in progress, step 4
909c2f0
verified
CohenQu
commited on
7 days ago
Training in progress, step 3
3465162
verified
CohenQu
commited on
7 days ago
Training in progress, step 3
d28888c
verified
CohenQu
commited on
7 days ago
Training in progress, step 2
c9c30e4
verified
CohenQu
commited on
7 days ago
Training in progress, step 2
e726f3b
verified
CohenQu
commited on
7 days ago
Training in progress, step 1
3c0f37f
verified
CohenQu
commited on
7 days ago
Training in progress, step 1
345adaf
verified
CohenQu
commited on
7 days ago
End of training
94264dd
verified
CohenQu
commited on
7 days ago
Model save
e8bbf5e
verified
CohenQu
commited on
7 days ago
Training in progress, step 24
e50968a
verified
CohenQu
commited on
7 days ago
End of training
6b00451
verified
CohenQu
commited on
7 days ago
Model save
5a70591
verified
CohenQu
commited on
7 days ago
Training in progress, step 24
2ed1214
verified
CohenQu
commited on
7 days ago
Training in progress, step 23
fb4c3ca
verified
CohenQu
commited on
7 days ago
Training in progress, step 23
ee575dc
verified
CohenQu
commited on
7 days ago
Training in progress, step 22
d80e95d
verified
CohenQu
commited on
7 days ago
Training in progress, step 22
ca7c62a
verified
CohenQu
commited on
7 days ago
Training in progress, step 21
48275f2
verified
CohenQu
commited on
7 days ago
Training in progress, step 21
79822bf
verified
CohenQu
commited on
7 days ago
Training in progress, step 20
ce72b73
verified
CohenQu
commited on
7 days ago
Training in progress, step 20
ac7c647
verified
CohenQu
commited on
7 days ago
Training in progress, step 19
481c1b7
verified
CohenQu
commited on
7 days ago
Training in progress, step 19
c2f6ab5
verified
CohenQu
commited on
7 days ago
Training in progress, step 18
efc1a67
verified
CohenQu
commited on
7 days ago
Training in progress, step 18
17364b0
verified
CohenQu
commited on
7 days ago
Training in progress, step 17
7fc728e
verified
CohenQu
commited on
7 days ago
Training in progress, step 17
9320f94
verified
CohenQu
commited on
7 days ago
Training in progress, step 16
0df0958
verified
CohenQu
commited on
7 days ago
Training in progress, step 16
f397ee3
verified
CohenQu
commited on
7 days ago
Training in progress, step 15
db70d00
verified
CohenQu
commited on
7 days ago
Training in progress, step 15
c96c7b7
verified
CohenQu
commited on
7 days ago
Training in progress, step 14
0494053
verified
CohenQu
commited on
7 days ago
Training in progress, step 14
dae6fe7
verified
CohenQu
commited on
7 days ago
Training in progress, step 13
c9244b8
verified
CohenQu
commited on
7 days ago
Training in progress, step 13
da3563a
verified
CohenQu
commited on
7 days ago
Training in progress, step 12
81f2a59
verified
CohenQu
commited on
7 days ago
Training in progress, step 12
77a4d0a
verified
CohenQu
commited on
7 days ago
Training in progress, step 11
4275b3a
verified
CohenQu
commited on
7 days ago
Training in progress, step 11
78d3c6d
verified
CohenQu
commited on
7 days ago
Training in progress, step 10
9f1c97e
verified
CohenQu
commited on
7 days ago
Training in progress, step 10
5254b21
verified
CohenQu
commited on
7 days ago
Previous
1
...
3
4
5
6
7
8
Next