Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RLHFlow
's Collections
Online-DPO-R1
Decision-Tree Reward Models
RLHFlow MATH Process Reward Model
Standard-format-preference-dataset
Mixture-of-preference-reward-modeling
RM-Bradley-Terry
PM-pair
Online RLHF
RLHFLow Reward Models
SFT Models
Decision-Tree Reward Models
updated
Feb 5
Upvote
1
RLHFlow/Decision-Tree-Reward-Gemma-2-27B
Text Classification
•
Updated
Jan 24
•
94
•
4
RLHFlow/Decision-Tree-Reward-Llama-3.1-8B
Text Classification
•
Updated
Jan 24
•
296
•
5
RLHFlow/LLM-Preferences-HelpSteer2
Viewer
•
Updated
Feb 5
•
9.13k
•
143
•
1
Upvote
1
Share collection
View history
Collection guide
Browse collections