SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems Paper • 2401.03945 • Published Jan 8, 2024
SpeechAlign: Aligning Speech Generation to Human Preferences Paper • 2404.05600 • Published Apr 8, 2024 • 1
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model Paper • 2408.02503 • Published Aug 5, 2024
Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models Paper • 2411.09691 • Published Nov 14, 2024
QCRD: Quality-guided Contrastive Rationale Distillation for Large Language Models Paper • 2405.13014 • Published May 14, 2024