Structured 3D Latents for Scalable and Versatile 3D Generation Paper ā¢ 2412.01506 ā¢ Published Dec 2, 2024 ā¢ 51
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes Paper ā¢ 2306.13649 ā¢ Published Jun 23, 2023 ā¢ 17
Cautious Optimizers: Improving Training with One Line of Code Paper ā¢ 2411.16085 ā¢ Published Nov 25, 2024 ā¢ 15
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Paper ā¢ 2409.02634 ā¢ Published Sep 4, 2024 ā¢ 92
Memory-Efficient LLM Training with Online Subspace Descent Paper ā¢ 2408.12857 ā¢ Published Aug 23, 2024 ā¢ 13
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15, 2024 ā¢ 171
Longhorn: State Space Models are Amortized Online Learners Paper ā¢ 2407.14207 ā¢ Published Jul 19, 2024 ā¢ 18
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper ā¢ 2311.06242 ā¢ Published Nov 10, 2023 ā¢ 87
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry Paper ā¢ 2402.04347 ā¢ Published Feb 6, 2024 ā¢ 13
Towards Modular LLMs by Building and Reusing a Library of LoRAs Paper ā¢ 2405.11157 ā¢ Published May 18, 2024 ā¢ 28
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper ā¢ 2405.07518 ā¢ Published May 13, 2024 ā¢ 24
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper ā¢ 2404.14219 ā¢ Published Apr 22, 2024 ā¢ 254
Efficiently Adapting Pretrained Language Models To New Languages Paper ā¢ 2311.05741 ā¢ Published Nov 9, 2023 ā¢ 11
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method Paper ā¢ 2402.17193 ā¢ Published Feb 27, 2024 ā¢ 23
Training-Free Long-Context Scaling of Large Language Models Paper ā¢ 2402.17463 ā¢ Published Feb 27, 2024 ā¢ 19
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Paper ā¢ 2402.17485 ā¢ Published Feb 27, 2024 ā¢ 190
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper ā¢ 2312.00752 ā¢ Published Dec 1, 2023 ā¢ 139
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding Paper ā¢ 2306.02858 ā¢ Published Jun 5, 2023 ā¢ 19
PIE: Simulating Disease Progression via Progressive Image Editing Paper ā¢ 2309.11745 ā¢ Published Sep 21, 2023 ā¢ 3