Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
arnaudbergeron
/
llama8bgsm_mixed_rwds
like
0
Safetensors
Model card
Files
Files and versions
Community
main
llama8bgsm_mixed_rwds
/
16_samples_per_episodes_ppo
2 contributors
History:
1 commit
arnaudbergeron
added ppo
27c0260
16 days ago
hf_pretrained
added ppo
16 days ago
merged_model_sf_tensors
added ppo
16 days ago