Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
sophiex
/
dpo_pythia1b_hh_rlhf.yml_local_29-04-24_13-31-33_xxxxx
like
0
PEFT
Safetensors
Generated from Trainer
Model card
Files
Files and versions
Community
Use this model
main
dpo_pythia1b_hh_rlhf.yml_local_29-04-24_13-31-33_xxxxx
/
README.md
Commit History
Model save
5fe0d51
verified
sophiex
commited on
Apr 29, 2024