Submitted by akhaliq 156 OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models · 5 authors 17
Submitted by Myashka 108 The Differences Between Direct Alignment Algorithms are a Blur · 5 authors 1
Submitted by ahmed-masry 31 AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding · 22 authors 2
Submitted by jimi888 25 SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model · 11 authors 3
Submitted by RohitGandikota 22 SliderSpace: Decomposing the Visual Capabilities of Diffusion Models · 6 authors 3
Submitted by huanqia 21 MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models · 3 authors 2
Submitted by yiren98 18 MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation · 3 authors 2
Submitted by xinyan233333 16 DeepRAG: Thinking to Retrieval Step by Step for Large Language Models · 9 authors 2
Submitted by dongwonjo 14 FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation · 4 authors 2
Submitted by akhaliq 13 ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning · 7 authors 2
Submitted by PAlbert31 9 RandLoRA: Full-rank parameter-efficient fine-tuning of large models · 6 authors 3
Submitted by akhaliq 9 The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles · 4 authors 2
Submitted by hba123 8 Almost Surely Safe Alignment of Large Language Models at Inference-Time · 6 authors 2
Submitted by arjunguha 7 PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models · 8 authors 4
Submitted by akshat57 4 Lifelong Sequential Knowledge Editing without Model Degradation · 6 authors 2
Submitted by Bowen232 3 LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information · 6 authors 2
Submitted by moein99 3 A Study on the Performance of U-Net Modifications in Retroperitoneal Tumor Segmentation · 8 authors 3
Submitted by vshrivas 2 Language Models Prefer What They Know: Relative Confidence Estimation via Confidence Preferences · 3 authors 2
Submitted by EdwinDdeJong 2 Current Pathology Foundation Models are unrobust to Medical Center Differences · 3 authors 2