SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights Paper β’ 2410.09008 β’ Published Oct 11, 2024 β’ 17
HuggingFaceTB/SmolVLM2-256M-Video-Instruct Image-Text-to-Text β’ Updated 4 days ago β’ 4.16k β’ 38
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning Paper β’ 2502.14768 β’ Published 18 days ago β’ 44
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Paper β’ 2502.16894 β’ Published 14 days ago β’ 26
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ Updated 2 days ago β’ 231k β’ 1.05k
Physics of Language Models: Part 1, Context-Free Grammar Paper β’ 2305.13673 β’ Published May 23, 2023 β’ 7
LoRA: Low-Rank Adaptation of Large Language Models Paper β’ 2106.09685 β’ Published Jun 17, 2021 β’ 35
Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems Paper β’ 2408.16293 β’ Published Aug 29, 2024 β’ 26
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process Paper β’ 2407.20311 β’ Published Jul 29, 2024 β’ 5
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws Paper β’ 2404.05405 β’ Published Apr 8, 2024 β’ 10
Physics of Language Models: Part 3.2, Knowledge Manipulation Paper β’ 2309.14402 β’ Published Sep 25, 2023 β’ 7
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction Paper β’ 2309.14316 β’ Published Sep 25, 2023 β’ 8
DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks Paper β’ 2502.17157 β’ Published 14 days ago β’ 51