Collections

Discover the best community collections!

Collections including paper arxiv:2310.03744
LLaVa-NeXT
LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets.
Top Vision-Language Papers 🖼️💬📝
A curated list of papers on vision-language models, with the most influential ones at the top.
MM-LLMs
Collection by Sep 9, 2024
multilingual vision models
Some papers I read for understanding vision models and also adding multilingual capabilities to them
Multimodal Papers
Collection by Apr 22, 2024
Vision Language Models Papers 🖼️💬📝
Papers about vision-language models, most important ones are on top of the list.