Submitted by akhaliq 26 MatAnyone: Stable Video Matting with Consistent Memory Propagation · 5 authors 2
Submitted by Qika 22 Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models · 8 authors 3
Submitted by nielsr 10 DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning · 4 authors 2
Submitted by akhaliq 8 Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming · 43 authors 5
Submitted by bcywinski 6 SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders · 2 authors 2
Submitted by fabian-sp 5 The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training · 5 authors 3
Submitted by mirshad7 4 Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion · 5 authors 2
Submitted by odabashi 3 Unraveling the Capabilities of Language Models in News Summarization · 2 authors 3
Submitted by nielsr 3 Fast Encoder-Based 3D from Casual Videos via Point Track Processing · 3 authors 2
Submitted by lwpyh 2 INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation · 3 authors 2
Submitted by Dominic789654 1 ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference · 7 authors 2