view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 1 day ago • 171
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 29 days ago • 49
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 203
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 • 71
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 208
view article Article 🤗 Serve any model with Inference Endpoints + Custom Handlers By alvarobartt • Nov 22, 2024 • 3