Submitted by poeroz 56 LLaMA-Omni: Seamless Speech Interaction with Large Language Models · 6 authors 5
Submitted by antonioloison 38 GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering · 4 authors 2
Submitted by Agorium 25 INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding · 3 authors 2
Submitted by akhaliq 15 Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis · 9 authors 2
Submitted by akhaliq 15 SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation · 6 authors 2
Submitted by archana14 3 LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation · 6 authors 2