Qwen-2.5-7B-Simple-RL / train_results.json
shuheikurita's picture
Model save
f7bd884 verified
{
"total_flos": 0.0,
"train_loss": 4.3852842645719646e-05,
"train_runtime": 520.3847,
"train_samples": 7500,
"train_samples_per_second": 1.441,
"train_steps_per_second": 0.01
}