Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
CohenQu
/
DeepSeek-R1-Distill-Qwen-7B-GRPO
like
4
Text Generation
Transformers
Safetensors
hf-cmu-collab/DeepScaleR-1.5B-Preview_on-policy_GRPO
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-7B-GRPO
Commit History
Training in progress, step 5
a308364
verified
CohenQu
commited on
4 days ago
Training in progress, step 4
ae0f601
verified
CohenQu
commited on
4 days ago
Training in progress, step 4
7faec54
verified
CohenQu
commited on
4 days ago
Training in progress, step 3
b0de973
verified
CohenQu
commited on
4 days ago
Training in progress, step 3
1a83797
verified
CohenQu
commited on
4 days ago
Training in progress, step 2
6ed5bb8
verified
CohenQu
commited on
4 days ago
Training in progress, step 2
656d92d
verified
CohenQu
commited on
4 days ago
Training in progress, step 1
8deeecf
verified
CohenQu
commited on
4 days ago
Training in progress, step 1
4fa2e62
verified
CohenQu
commited on
4 days ago
End of training
5a56387
verified
CohenQu
commited on
4 days ago
Model save
c60a930
verified
CohenQu
commited on
4 days ago
Training in progress, step 24
b5ef6c0
verified
CohenQu
commited on
4 days ago
End of training
4b87311
verified
CohenQu
commited on
4 days ago
Model save
09f06ab
verified
CohenQu
commited on
4 days ago
Training in progress, step 24
6a00f74
verified
CohenQu
commited on
4 days ago
Training in progress, step 23
4466297
verified
CohenQu
commited on
4 days ago
Training in progress, step 23
2aee565
verified
CohenQu
commited on
4 days ago
Training in progress, step 22
6e3cd40
verified
CohenQu
commited on
4 days ago
Training in progress, step 22
5b25783
verified
CohenQu
commited on
4 days ago
Training in progress, step 21
e1b5832
verified
CohenQu
commited on
4 days ago
Training in progress, step 21
0216f96
verified
CohenQu
commited on
4 days ago
Training in progress, step 20
fc04894
verified
CohenQu
commited on
4 days ago
Training in progress, step 20
78a05e0
verified
CohenQu
commited on
4 days ago
Training in progress, step 19
78aa712
verified
CohenQu
commited on
4 days ago
Training in progress, step 19
1d1dd67
verified
CohenQu
commited on
4 days ago
Training in progress, step 18
087063c
verified
CohenQu
commited on
4 days ago
Training in progress, step 18
b2e0d34
verified
CohenQu
commited on
4 days ago
Training in progress, step 17
60cc1cc
verified
CohenQu
commited on
4 days ago
Training in progress, step 17
573b223
verified
CohenQu
commited on
4 days ago
Training in progress, step 16
e20fc58
verified
CohenQu
commited on
4 days ago
Training in progress, step 16
cdd09e0
verified
CohenQu
commited on
4 days ago
Training in progress, step 15
92bda98
verified
CohenQu
commited on
4 days ago
Training in progress, step 15
f52c859
verified
CohenQu
commited on
4 days ago
Training in progress, step 14
3eafadb
verified
CohenQu
commited on
4 days ago
Training in progress, step 14
607a109
verified
CohenQu
commited on
4 days ago
Training in progress, step 13
47365d0
verified
CohenQu
commited on
4 days ago
Training in progress, step 13
04dbe9f
verified
CohenQu
commited on
4 days ago
Training in progress, step 12
e7d9a32
verified
CohenQu
commited on
4 days ago
Training in progress, step 12
078704e
verified
CohenQu
commited on
4 days ago
Training in progress, step 11
b962ebd
verified
CohenQu
commited on
4 days ago
Training in progress, step 11
75e1dd5
verified
CohenQu
commited on
4 days ago
Training in progress, step 10
ba7407d
verified
CohenQu
commited on
4 days ago
Training in progress, step 10
4034555
verified
CohenQu
commited on
4 days ago
Training in progress, step 9
a4e1137
verified
CohenQu
commited on
4 days ago
Training in progress, step 9
a65e385
verified
CohenQu
commited on
4 days ago
Training in progress, step 8
4ef168a
verified
CohenQu
commited on
4 days ago
Training in progress, step 8
9c11460
verified
CohenQu
commited on
4 days ago
Training in progress, step 7
91e4250
verified
CohenQu
commited on
4 days ago
Training in progress, step 7
ed9585a
verified
CohenQu
commited on
4 days ago
Training in progress, step 6
225b549
verified
CohenQu
commited on
4 days ago
Previous
1
2
3
4
5
...
8
Next