The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 25 days ago • 182
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations Paper • 2502.05003 • Published Feb 7 • 43
AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting Paper • 2502.05176 • Published about 1 month ago • 32
Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation Paper • 2502.05415 • Published about 1 month ago • 22
Retrieval-augmented Large Language Models for Financial Time Series Forecasting Paper • 2502.05878 • Published 29 days ago • 39
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Paper • 2502.07346 • Published 27 days ago • 51
NoLiMa: Long-Context Evaluation Beyond Literal Matching Paper • 2502.05167 • Published about 1 month ago • 15
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning Paper • 2502.06533 • Published 28 days ago • 18