DeepSeek-R1-Distill-Qwen-7B-GRPO / model-00002-of-00004.safetensors

Commit History

Training in progress, step 80
eb10412
verified

CohenQu commited on

Training in progress, step 60
0fc30e2
verified

CohenQu commited on

Training in progress, step 40
6dc48a1
verified

CohenQu commited on

Training in progress, step 20
fe77746
verified

CohenQu commited on

Training in progress, step 40
069ea2c
verified

CohenQu commited on

Training in progress, step 20
17e8d9c
verified

CohenQu commited on