MoBA: Mixture of Block Attention for Long-Context LLMs Paper • 2502.13189 • Published 20 days ago • 14
HelpSteer2: Open-source dataset for training top-performing reward models Paper • 2406.08673 • Published Jun 12, 2024 • 19
Standard-format-preference-dataset Collection We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8, 2024 • 24