Ross Wightman

rwightman

AI & ML interests

Computer vision, transfer learning, semi/self supervised learning, robotics.

Recent Activity

reacted to merve's post with πŸ”₯ 3 days ago
Oof, what a week! πŸ₯΅ So many things have happened, let's recap! https://huggingface.co/collections/merve/jan-24-releases-6793d610774073328eac67a9 Multimodal πŸ’¬ - We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG πŸ’— - UI-TARS are new models by ByteDance to unlock agentic GUI control 🀯 in 2B, 7B and 72B - Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B - MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context - Dataset: Yale released a new benchmark called MMVU - Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark LLMs πŸ“– - DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! 🀯 - Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B - NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!) Audio πŸ—£οΈ - Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B - TangoFlux is a new audio generation model trained from scratch and aligned with CRPO Image/Video/3D Generation ⏯️ - Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux - tencent released Hunyuan3D-2, new 3D asset generation from images
View all activity

Articles

Organizations

Hugging Face's profile picture PyTorch Image Models's profile picture Spaces-explorers's profile picture Flax Community's profile picture LAION eV's profile picture Pixel Parsing's profile picture

rwightman's activity

New activity in safetensors/convert 3 days ago
New activity in safetensors/convert 6 days ago

Update convert.py

#37 opened 6 days ago by
rwightman
New activity in pixparse/cc3m-wds about 1 month ago
New activity in timm/mobilenetv4_conv_small.e2400_r224_in1k about 1 month ago
New activity in timm/efficientformerv2_s1.snap_dist_in1k about 2 months ago
New activity in timm/efficientformerv2_s2.snap_dist_in1k about 2 months ago
New activity in timm/fastvit_t8.apple_dist_in1k about 2 months ago

Update "first_conv" in config.json

#2 opened about 2 months ago by
Cinq108
New activity in timm/efficientformer_l1.snap_dist_in1k about 2 months ago
New activity in timm/efficientformerv2_l.snap_dist_in1k about 2 months ago
New activity in pixparse/cc12m-wds 2 months ago