Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
arnaudbergeron
/
llama8bgsm_mixed_rwds
like
0
Safetensors
Model card
Files
Files and versions
Community
main
llama8bgsm_mixed_rwds
2 contributors
History:
7 commits
arnaudbergeron
added ppo
27c0260
15 days ago
12_samples_per_episodes_freeze
st added
16 days ago
16_samples_per_episodes_model
st added
16 days ago
16_samples_per_episodes_ppo
added ppo
15 days ago
.gitattributes
Safe
2.03 kB
st added
16 days ago