YOLOv12: Attention-Centric Real-Time Object Detectors Paper • 2502.12524 • Published 11 days ago • 10
You Do Not Fully Utilize Transformer's Representation Capacity Paper • 2502.09245 • Published 16 days ago • 33
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models Paper • 2502.13533 • Published 10 days ago • 9
Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data Paper • 2502.14044 • Published 9 days ago • 7
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models Paper • 2502.14802 • Published 8 days ago • 11
Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning Paper • 2502.14372 • Published 9 days ago • 35
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Paper • 2502.14502 • Published 9 days ago • 80
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning Paper • 2502.15425 • Published 8 days ago • 7
Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation Paper • 2502.16707 • Published 5 days ago • 10
Curie: Toward Rigorous and Automated Scientific Experimentation with AI Agents Paper • 2502.16069 • Published 7 days ago • 16
R18-Novels-galgame Collection Novels; galgame; visual novels; 小说; 剧本; roleplay; sq; ghs; hentai; R18; NSFW; 涩情; 涩涩; 瑟瑟; 色色; 可爱; 美少女 • 61 items • Updated 11 days ago • 36
Running 1.79k 1.79k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 232