Vaibhav Srivastav's picture

Vaibhav Srivastav PRO

reach-vb

·

https://vaibhavs10.github.io

AI & ML interests

TTS + LM performance prediction

Recent Activity

new activity about 5 hours ago

reach-vb/2025-ai-timeline:Update index.html

updated a Space about 5 hours ago

reach-vb/2025-ai-timeline

liked a Space about 5 hours ago

moondream/gaze-demo

View all activity

Articles

Faster Text Generation with Self-Speculative Decoding

Llama can now see and run on your device - welcome Llama 3.2

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

WWDC 24: Running Mistral 7B with Core ML

Welcome Gemma 2 - Google's new open LLM

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

CodeGemma - an official Google release for code LLMs

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

AI Watermarking 101: Tools and Techniques

Deploy MusicGen in no time with Inference Endpoints

Jupyter X Hugging Face

Swift Diffusers: Fast Stable Diffusion for Mac

Organizations

reach-vb's activity

upvoted a paper 4 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 5 days ago • 194

upvoted a collection 4 days ago

Sa2VA model zoo

3 items • Updated 5 days ago • 20

upvoted a collection 6 days ago

Cosmos

The collection of Cosmos models • 31 items • Updated 2 days ago • 212

upvoted a paper 9 days ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published 12 days ago • 15

upvoted a collection 9 days ago

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 6 days ago • 74

upvoted 2 collections 11 days ago

Yi VL

2 items • Updated May 11, 2024 • 2

Falcon2

5 items • Updated 5 days ago • 5

upvoted 5 collections 12 days ago

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated 12 days ago • 39

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 106

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 459

Chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR. • 2 items • Updated Jul 9, 2024 • 28

Stable Diffusion 3

Stable Diffusion 3 and related models for text-to-image and image-to-image • 2 items • Updated 4 days ago • 93

upvoted a collection 18 days ago

DeepSeek-V3

3 items • Updated 7 days ago • 112

upvoted 2 collections 20 days ago

NeMo Audio Codecs

A series of Neural Audio Codecs • 5 items • Updated 2 days ago • 10

InternVL2.5-MPO

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 3 days ago • 24

upvoted an article 21 days ago

Article

FineWeb2-C: Help Build Better Language Models in Your Language

By

•

21 days ago

• 12

upvoted a collection 23 days ago

📐 FineMath

FineMath datasets and ablation models • 14 items • Updated 6 days ago • 17

upvoted 2 collections 25 days ago

Bamba

Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 25 days ago • 18

Granite 3.1 Language Models

A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 26 days ago • 48

upvoted a collection 27 days ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 5 days ago • 78