Submitted by CodeGoat24 80 Unified Reward Model for Multimodal Understanding and Generation · 5 authors 2
Submitted by Nicolas-BZRD 45 EuroBERT: Scaling Multilingual Encoders for European Languages · 19 authors 5
Submitted by liuxuan320 37 S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information · 6 authors 1
Submitted by jinheon 30 Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching · 3 authors 2
Submitted by zhixuan-lin 16 Forgetting Transformer: Softmax Attention with a Forget Gate · 4 authors 1
Submitted by akhaliq 12 R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning · 8 authors 1
Submitted by BianYx 11 VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control · 7 authors 1
Submitted by akhaliq 9 R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning · 3 authors 2
Submitted by wbhu-tc 8 TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models · 4 authors 1
Submitted by yunfanj 8 BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities · 10 authors 1
Submitted by akhaliq 6 TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation · 17 authors 1
Submitted by tobiaslee 5 LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding · 11 authors 1
Submitted by EliverQ 5 An Empirical Study on Eliciting and Improving R1-like Reasoning Models · 13 authors 1
Submitted by akhaliq 3 R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model · 6 authors 1
Submitted by wangkevin02 2 Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles · 6 authors 2
Submitted by hongyanz 1 EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test · 4 authors 1
Submitted by SkiddieAhn 1 AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM · 6 authors 1