Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
CohenQu
/
DeepSeek-R1-Distill-Qwen-7B-GRPO
like
4
Text Generation
Transformers
Safetensors
hf-cmu-collab/DeepScaleR-1.5B-Preview_on-policy_GRPO
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-7B-GRPO
Commit History
Training in progress, step 17
88632ae
verified
CohenQu
commited on
9 days ago
Training in progress, step 16
9052815
verified
CohenQu
commited on
9 days ago
Training in progress, step 15
76337b7
verified
CohenQu
commited on
9 days ago
Training in progress, step 14
3cf3755
verified
CohenQu
commited on
9 days ago
Training in progress, step 13
9acac60
verified
CohenQu
commited on
9 days ago
Training in progress, step 12
2dd44e3
verified
CohenQu
commited on
9 days ago
Training in progress, step 11
5900317
verified
CohenQu
commited on
9 days ago
Training in progress, step 9
cbeff76
verified
CohenQu
commited on
9 days ago
Training in progress, step 8
473a5b2
verified
CohenQu
commited on
9 days ago
Training in progress, step 7
e774e36
verified
CohenQu
commited on
9 days ago
Training in progress, step 6
0ae729a
verified
CohenQu
commited on
9 days ago
Training in progress, step 5
5da45db
verified
CohenQu
commited on
9 days ago
Training in progress, step 4
91f02da
verified
CohenQu
commited on
9 days ago
Training in progress, step 3
4f28678
verified
CohenQu
commited on
9 days ago
Training in progress, step 2
fff582c
verified
CohenQu
commited on
9 days ago
Training in progress, step 1
b7cd301
verified
CohenQu
commited on
9 days ago
End of training
6140715
verified
CohenQu
commited on
10 days ago
Model save
05405a0
verified
CohenQu
commited on
10 days ago
Training in progress, step 48
22f4bfe
verified
CohenQu
commited on
10 days ago
End of training
21cbb92
verified
CohenQu
commited on
10 days ago
Model save
2d4483d
verified
CohenQu
commited on
10 days ago
Training in progress, step 48
f298c20
verified
CohenQu
commited on
10 days ago
Training in progress, step 44
5a012aa
verified
CohenQu
commited on
10 days ago
Training in progress, step 44
30ea9cf
verified
CohenQu
commited on
10 days ago
Training in progress, step 40
48c8f98
verified
CohenQu
commited on
10 days ago
Training in progress, step 40
ebf4a35
verified
CohenQu
commited on
10 days ago
Training in progress, step 36
9558c9f
verified
CohenQu
commited on
10 days ago
Training in progress, step 36
5de7f5d
verified
CohenQu
commited on
10 days ago
Training in progress, step 32
de0d0c9
verified
CohenQu
commited on
10 days ago
Training in progress, step 32
cfeb2d8
verified
CohenQu
commited on
10 days ago
Training in progress, step 28
32ae4b3
verified
CohenQu
commited on
10 days ago
Training in progress, step 28
b37d6af
verified
CohenQu
commited on
10 days ago
Training in progress, step 24
3a387e2
verified
CohenQu
commited on
10 days ago
Training in progress, step 24
54831ad
verified
CohenQu
commited on
10 days ago
Training in progress, step 20
ddfd928
verified
CohenQu
commited on
10 days ago
Training in progress, step 20
48ba2bf
verified
CohenQu
commited on
10 days ago
Training in progress, step 16
b66c45a
verified
CohenQu
commited on
10 days ago
Training in progress, step 16
a33d452
verified
CohenQu
commited on
10 days ago
Training in progress, step 12
8f4965c
verified
CohenQu
commited on
10 days ago
Training in progress, step 12
31c5df9
verified
CohenQu
commited on
10 days ago
Training in progress, step 8
ef7c58e
verified
CohenQu
commited on
10 days ago
Training in progress, step 8
6f78024
verified
CohenQu
commited on
10 days ago
Training in progress, step 4
342404a
verified
CohenQu
commited on
10 days ago
Training in progress, step 4
45b7fce
verified
CohenQu
commited on
10 days ago
End of training
f456fa7
verified
CohenQu
commited on
11 days ago
Model save
b8f9ded
verified
CohenQu
commited on
11 days ago
Training in progress, step 35
5fbe824
verified
CohenQu
commited on
11 days ago
Training in progress, step 30
4a9c570
verified
CohenQu
commited on
12 days ago
Training in progress, step 25
6d831ff
verified
CohenQu
commited on
12 days ago
Training in progress, step 20
a4719fb
verified
CohenQu
commited on
12 days ago
Previous
1
...
5
6
7
8
Next