Alvaro Bartolome's picture

Alvaro Bartolome

alvarobartt

·

https://alvarobartt.me

AI & ML interests

machine learning @huggingface

Recent Activity

reacted to merve's post with 👀 3 days ago

Oof, what a week! 🥵 So many things have happened, let's recap! https://huggingface.co/collections/merve/jan-24-releases-6793d610774073328eac67a9 Multimodal 💬 - We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG 💗 - UI-TARS are new models by ByteDance to unlock agentic GUI control 🤯 in 2B, 7B and 72B - Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B - MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context - Dataset: Yale released a new benchmark called MMVU - Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark LLMs 📖 - DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! 🤯 - Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B - NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!) Audio 🗣️ - Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B - TangoFlux is a new audio generation model trained from scratch and aligned with CRPO Image/Video/3D Generation ⏯️ - Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux - tencent released Hunyuan3D-2, new 3D asset generation from images

reacted to merve's post with 🤗 3 days ago

Oof, what a week! 🥵 So many things have happened, let's recap! https://huggingface.co/collections/merve/jan-24-releases-6793d610774073328eac67a9 Multimodal 💬 - We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG 💗 - UI-TARS are new models by ByteDance to unlock agentic GUI control 🤯 in 2B, 7B and 72B - Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B - MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context - Dataset: Yale released a new benchmark called MMVU - Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark LLMs 📖 - DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! 🤯 - Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B - NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!) Audio 🗣️ - Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B - TangoFlux is a new audio generation model trained from scratch and aligned with CRPO Image/Video/3D Generation ⏯️ - Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux - tencent released Hunyuan3D-2, new 3D asset generation from images

replied to merve's post 3 days ago

Oof, what a week! 🥵 So many things have happened, let's recap! https://huggingface.co/collections/merve/jan-24-releases-6793d610774073328eac67a9 Multimodal 💬 - We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG 💗 - UI-TARS are new models by ByteDance to unlock agentic GUI control 🤯 in 2B, 7B and 72B - Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B - MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context - Dataset: Yale released a new benchmark called MMVU - Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark LLMs 📖 - DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! 🤯 - Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B - NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!) Audio 🗣️ - Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B - TangoFlux is a new audio generation model trained from scratch and aligned with CRPO Image/Video/3D Generation ⏯️ - Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux - tencent released Hunyuan3D-2, new 3D asset generation from images

View all activity

Articles

🤗 Serve any model with Inference Endpoints + Custom Handlers

Introducing HUGS - Scale your AI with Open Models

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

Deploying 🤗 Hub models in Vertex AI

🏷️ Build AI Feedback (AIF) datasets for LLM alignment with ⚗️ distilabel

💨 Introducing Notus: a DPO fine-tune of Zephyr with a focus on high-quality data

🤗 LLM suggestions in Argilla with HuggingFace Inference Endpoints

Organizations

alvarobartt's activity

liked a model 7 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 1 day ago • 149k • 3.33k

liked a model 11 days ago

nomic-ai/modernbert-embed-base

Sentence Similarity • Updated 3 days ago • 88k • 181

liked a model 12 days ago

answerdotai/ModernBERT-base

Fill-Mask • Updated 12 days ago • 4.76M • 712

liked 6 models 13 days ago

openbmb/MiniCPM-o-2_6

Any-to-Any • Updated 1 day ago • 80.2k • 846

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated 14 days ago • 12.6k • 515

vikhyatk/moondream2

Image-Text-to-Text • Updated 18 days ago • 157k • 1.01k

deepseek-ai/DeepSeek-V3

Text Generation • Updated 3 days ago • 278k • 2.41k

hexgrad/Kokoro-82M

Text-to-Speech • Updated 3 days ago • 38.9k • 2.46k

openbmb/MiniCPM-V-2_6

Image-Text-to-Text • Updated 12 days ago • 86.1k • 921

liked a dataset 18 days ago

jinaai/negation-dataset

Viewer • Updated Nov 8, 2023 • 10.5k • 133 • 21

liked 2 Spaces 18 days ago

2024 AI Timeline

Infinite Dataset Hub

Search and save datasets generated with a LLM in real time

liked a model 18 days ago

unsloth/phi-4

Text Generation • Updated 14 days ago • 17k • 66

liked a Space 19 days ago

Scaling test-time compute

liked 3 models about 1 month ago

Qwen/Qwen-VL-Chat

Text Generation • Updated Jan 25, 2024 • 22.1k • 350

tencent/HunyuanVideo

Text-to-Video • Updated 6 days ago • 8.01k • 1.52k

casperhansen/llama-3.3-70b-instruct-awq

Text Generation • Updated Dec 6, 2024 • 33.8k • 26

liked 3 models about 2 months ago

google/paligemma2-3b-pt-224

Image-Text-to-Text • Updated Dec 5, 2024 • 44.4k • 133

google/paligemma2-3b-pt-448

Image-Text-to-Text • Updated Dec 5, 2024 • 12.6k • 40

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 581k • • 1.77k