OpenRLHF

community

https://github.com/OpenRLHF

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Longhui98 authored a paper 3 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Longhui98 authored a paper 3 days ago

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Longhui98 authored a paper 3 days ago

Forward-Backward Reasoning in Large Language Models for Mathematical Verification

View all activity

OpenRLHF's activity

Longhui98

authored 4 papers 3 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 6 days ago • 63

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Paper • 2403.09472 • Published Mar 14, 2024 • 1

Forward-Backward Reasoning in Large Language Models for Mathematical Verification

Paper • 2308.07758 • Published Aug 15, 2023 • 4

DeepVecFont-v2: Exploiting Transformers to Synthesize Vector Fonts with Higher Quality

Paper • 2303.14585 • Published Mar 25, 2023

catqaq

updated a dataset about 2 months ago

OpenRLHF/prompt-collection-v0.1-dev-100k

Viewer • Updated Dec 13, 2024 • 102k • 70

chuyi777

updated 3 models about 2 months ago

chuyi777

updated a model 3 months ago

OpenRLHF/Mistral-7b-PRM-Math-Shepherd

Updated Oct 30, 2024 • 4 • 1

chuyi777

in OpenRLHF/Mistral-7b-PRM-Math-Shepherd 3 months ago

怎么下载模型呢？

#1 opened 3 months ago by

Yutong001

chuyi777

updated a model 6 months ago

OpenRLHF/Llama-3-8b-iter-dpo-179k

Text Generation • Updated Jul 28, 2024 • 12

chuyi777

updated a dataset 7 months ago

OpenRLHF/preference_700K

Viewer • Updated Jul 13, 2024 • 700k • 50 • 1

chuyi777

updated a model 7 months ago

OpenRLHF/Llama-3-8b-rlhf-100k

Text Generation • Updated Jun 24, 2024 • 156 • 3

chuyi777

updated 2 datasets 8 months ago

OpenRLHF/prompt-collection-v0.1

Viewer • Updated Jun 14, 2024 • 179k • 2.34k • 6

OpenRLHF/preference_dataset_mixture2_and_safe_pku

Viewer • Updated Jun 14, 2024 • 555k • 881 • 2

chuyi777

updated a model 8 months ago

OpenRLHF/Llama-3-8b-sft-mixture

Text Generation • Updated Jun 14, 2024 • 12.1k • 1

catqaq

authored a paper 8 months ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 36

atsushi3110

authored 2 papers 10 months ago

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Paper • 2404.05892 • Published Apr 8, 2024 • 33

RWKV: Reinventing RNNs for the Transformer Era

Paper • 2305.13048 • Published May 22, 2023 • 15

ZhangRC

authored a paper 10 months ago

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Paper • 2404.05892 • Published Apr 8, 2024 • 33

AI & ML interests

Recent Activity

Team members 7

OpenRLHF's activity

怎么下载模型呢？