Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
CohenQu
/
DeepSeek-R1-Distill-Qwen-7B-GRPO
like
4
Text Generation
Transformers
Safetensors
hf-cmu-collab/DeepScaleR-1.5B-Preview_on-policy_GRPO
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
e1a577a
DeepSeek-R1-Distill-Qwen-7B-GRPO
Commit History
Training in progress, step 22
e1a577a
verified
CohenQu
commited on
6 days ago
Training in progress, step 21
d599433
verified
CohenQu
commited on
6 days ago
Training in progress, step 21
00f2050
verified
CohenQu
commited on
6 days ago
Training in progress, step 20
497fc30
verified
CohenQu
commited on
6 days ago
Training in progress, step 20
7612506
verified
CohenQu
commited on
6 days ago
Training in progress, step 19
1b0b64c
verified
CohenQu
commited on
6 days ago
Training in progress, step 19
3c6712a
verified
CohenQu
commited on
6 days ago
Training in progress, step 18
fb839fb
verified
CohenQu
commited on
6 days ago
Training in progress, step 18
14e3c7f
verified
CohenQu
commited on
6 days ago
Training in progress, step 17
8ee496b
verified
CohenQu
commited on
6 days ago
Training in progress, step 17
89f6f1e
verified
CohenQu
commited on
6 days ago
Training in progress, step 16
42c0088
verified
CohenQu
commited on
6 days ago
Training in progress, step 16
3b28218
verified
CohenQu
commited on
6 days ago
Training in progress, step 15
75ecfae
verified
CohenQu
commited on
6 days ago
Training in progress, step 15
0ebf463
verified
CohenQu
commited on
6 days ago
Training in progress, step 14
6bcba43
verified
CohenQu
commited on
6 days ago
Training in progress, step 14
36ee2cb
verified
CohenQu
commited on
6 days ago
Training in progress, step 13
a0ea3df
verified
CohenQu
commited on
6 days ago
Training in progress, step 13
da87e27
verified
CohenQu
commited on
6 days ago
Training in progress, step 12
3071415
verified
CohenQu
commited on
7 days ago
Training in progress, step 12
3b44ff9
verified
CohenQu
commited on
7 days ago
Training in progress, step 11
96c1e0b
verified
CohenQu
commited on
7 days ago
Training in progress, step 11
a135dfa
verified
CohenQu
commited on
7 days ago
Training in progress, step 10
9f7b20c
verified
CohenQu
commited on
7 days ago
Training in progress, step 10
ff88265
verified
CohenQu
commited on
7 days ago
Training in progress, step 9
a1c2da7
verified
CohenQu
commited on
7 days ago
Training in progress, step 9
f017544
verified
CohenQu
commited on
7 days ago
Training in progress, step 8
1100c23
verified
CohenQu
commited on
7 days ago
Training in progress, step 8
b8a6fb5
verified
CohenQu
commited on
7 days ago
Training in progress, step 7
fe766d9
verified
CohenQu
commited on
7 days ago
Training in progress, step 7
e1bfaaf
verified
CohenQu
commited on
7 days ago
Training in progress, step 6
9305a89
verified
CohenQu
commited on
7 days ago
Training in progress, step 6
074aa8b
verified
CohenQu
commited on
7 days ago
Training in progress, step 5
d89ea43
verified
CohenQu
commited on
7 days ago
Training in progress, step 5
5129479
verified
CohenQu
commited on
7 days ago
Training in progress, step 4
29243b5
verified
CohenQu
commited on
7 days ago
Training in progress, step 4
909c2f0
verified
CohenQu
commited on
7 days ago
Training in progress, step 3
3465162
verified
CohenQu
commited on
7 days ago
Training in progress, step 3
d28888c
verified
CohenQu
commited on
7 days ago
Training in progress, step 2
c9c30e4
verified
CohenQu
commited on
7 days ago
Training in progress, step 2
e726f3b
verified
CohenQu
commited on
7 days ago
Training in progress, step 1
3c0f37f
verified
CohenQu
commited on
7 days ago
Training in progress, step 1
345adaf
verified
CohenQu
commited on
7 days ago
End of training
94264dd
verified
CohenQu
commited on
7 days ago
Model save
e8bbf5e
verified
CohenQu
commited on
7 days ago
Training in progress, step 24
e50968a
verified
CohenQu
commited on
7 days ago
End of training
6b00451
verified
CohenQu
commited on
7 days ago
Model save
5a70591
verified
CohenQu
commited on
7 days ago
Training in progress, step 24
2ed1214
verified
CohenQu
commited on
7 days ago
Training in progress, step 23
fb4c3ca
verified
CohenQu
commited on
7 days ago
Previous
1
2
3
...
5
Next