FastRM: An efficient and automatic explainability framework for multimodal generative models Paper β’ 2412.01487 β’ Published Dec 2, 2024 β’ 1
LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model Paper β’ 2404.01331 β’ Published Mar 29, 2024 β’ 25
Getting it Right: Improving Spatial Consistency in Text-to-Image Models Paper β’ 2404.01197 β’ Published Apr 1, 2024 β’ 30
Getting it Right: Improving Spatial Consistency in Text-to-Image Models Paper β’ 2404.01197 β’ Published Apr 1, 2024 β’ 30