110 60 77

Hugo Laurençon

HugoLaurencon

HugoLaurencon

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Autonomy-of-Experts Models

upvoted a paper 10 days ago

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

upvoted a paper 13 days ago

Tensor Product Attention Is All You Need

View all activity

Articles

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18, 2024

• 72

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 171

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Mar 15, 2024

• 7

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

Aug 22, 2023

• 29

Putting ethical principles at the core of research lifecycle

May 19, 2022

Organizations

HugoLaurencon's activity

upvoted a paper 4 days ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published 5 days ago • 36

upvoted a paper 10 days ago

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Paper • 2501.09755 • Published 11 days ago • 33

upvoted a paper 13 days ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 17 days ago • 75

upvoted a paper 24 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 26 days ago • 98

liked a dataset 24 days ago

DAMO-NLP-SG/multimodal_textbook

Updated 16 days ago • 13.4k • 131

New activity in HuggingFaceM4/idefics2-8b 25 days ago

Seems like the user prompt is ignored

#80 opened about 1 month ago by

jlmeunier

New activity in OS-Copilot/OS-Genesis-7B-AC 25 days ago

Permission error to access data

#1 opened 25 days ago by

HugoLaurencon

upvoted 2 papers about 1 month ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 343

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 124

New activity in HuggingFaceM4/idefics2-8b about 1 month ago

Seems like the user prompt is ignored

#80 opened about 1 month ago by

jlmeunier

upvoted 2 papers about 1 month ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 139

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 106