O1 - a jzwong Collection

jzwong 's Collections

LLM

SYS

O1

MLLM

O1

updated 3 days ago

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published Jan 10 • 61
Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published 17 days ago • 59
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 11 days ago • 132
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published 11 days ago • 56
LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published 16 days ago • 56
s1: Simple test-time scaling

Paper • 2501.19393 • Published 21 days ago • 105