facebook/roberta-hate-speech-dynabench-r4-target Text Classification • Updated Mar 16, 2023 • 1.53M • 70
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 19 days ago • 249
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published 24 days ago • 87