Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
sophiex
/
dpo_pythia1b_hh_rlhf.yml_local_29-04-24_13-31-33_xxxxx
like
0
PEFT
Safetensors
Generated from Trainer
Model card
Files
Files and versions
Community
Use this model
main
dpo_pythia1b_hh_rlhf.yml_local_29-04-24_13-31-33_xxxxx
/
adapter_model.safetensors
Commit History
Model save
5fe0d51
verified
sophiex
commited on
Apr 29, 2024
Training in progress, step 2012
7f88be1
verified
sophiex
commited on
Apr 29, 2024
Training in progress, step 1509
71f9d97
verified
sophiex
commited on
Apr 29, 2024
Training in progress, step 1006
6af8543
verified
sophiex
commited on
Apr 29, 2024
Training in progress, step 503
a07c5a1
verified
sophiex
commited on
Apr 29, 2024