Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
239
48
140
Ross Wightman
rwightman
Follow
rishiraj's profile picture
promptam's profile picture
aleau's profile picture
248 followers
Β·
93 following
wightmanr
rwightman
AI & ML interests
Computer vision, transfer learning, semi/self supervised learning, robotics.
Recent Activity
reacted
to
merve
's
post
with π₯
3 days ago
Oof, what a week! π₯΅ So many things have happened, let's recap! https://huggingface.co/collections/merve/jan-24-releases-6793d610774073328eac67a9 Multimodal π¬ - We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG π - UI-TARS are new models by ByteDance to unlock agentic GUI control π€― in 2B, 7B and 72B - Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B - MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context - Dataset: Yale released a new benchmark called MMVU - Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark LLMs π - DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! π€― - Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B - NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!) Audio π£οΈ - Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B - TangoFlux is a new audio generation model trained from scratch and aligned with CRPO Image/Video/3D Generation β―οΈ - Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux - tencent released Hunyuan3D-2, new 3D asset generation from images
new
activity
3 days ago
safetensors/convert:
Allow running conversion after closing a previous PR.
updated
a model
5 days ago
rwightman/vit_base_patch16_224.augreg2_in21k_ft_in1k_pets
View all activity
Articles
Timm β€οΈ Transformers: Use any timm model with transformers
12 days ago
β’
34
Trick or ResNet Treat
Oct 31, 2024
β’
4
Mamba Out
Oct 18, 2024
β’
8
Tiny Test Models
Oct 2, 2024
β’
6
Searching for better (Full) ImageNet ViT Baselines
Aug 26, 2024
β’
4
MobileNet Baselines
Jul 26, 2024
β’
23
MobileNet-V4 (now in timm)
Jun 17, 2024
β’
40
Organizations
rwightman
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
safetensors/convert
3 days ago
Allow running conversion after closing a previous PR.
8
#21 opened about 1 year ago by
rwightman
New activity in
safetensors/convert
6 days ago
Update convert.py
#37 opened 6 days ago by
rwightman
New activity in
timm/efficientformer_l7.snap_dist_in1k
11 days ago
Adding `safetensors` variant of this model
#1 opened 12 days ago by
SFconvertbot
New activity in
timm/davit_tiny.msft_in1k
11 days ago
Adding `safetensors` variant of this model
#1 opened 12 days ago by
SFconvertbot
New activity in
timm/davit_base.msft_in1k
11 days ago
Adding `safetensors` variant of this model
#2 opened 12 days ago by
SFconvertbot
New activity in
timm/levit_128.fb_dist_in1k
11 days ago
Adding `safetensors` variant of this model
#1 opened 12 days ago by
SFconvertbot
New activity in
timm/davit_small.msft_in1k
11 days ago
Adding `safetensors` variant of this model
#1 opened 12 days ago by
SFconvertbot
New activity in
timm/mobilenetv4_conv_small.e2400_r224_in1k
26 days ago
Can't get attribute 'UniversalInvertedResidual'
2
#7 opened 26 days ago by
sddwadsa
New activity in
pixparse/cc3m-wds
about 1 month ago
Converting Arrow to WebDataset TAR Format for Offline Use
2
#5 opened about 1 month ago by
katie312
New activity in
timm/mobilenetv4_conv_small.e2400_r224_in1k
about 1 month ago
model.eval()οΌresults are wrong?
1
#6 opened about 1 month ago by
liyufeng
New activity in
timm/efficientformerv2_s1.snap_dist_in1k
about 2 months ago
Adding `safetensors` variant of this model
#1 opened about 2 months ago by
SFconvertbot
New activity in
timm/efficientformerv2_s2.snap_dist_in1k
about 2 months ago
Adding `safetensors` variant of this model
#1 opened about 2 months ago by
SFconvertbot
New activity in
timm/fastvit_t8.apple_dist_in1k
about 2 months ago
Update "first_conv" in config.json
#2 opened about 2 months ago by
Cinq108
New activity in
timm/efficientformer_l1.snap_dist_in1k
about 2 months ago
Adding `safetensors` variant of this model
#1 opened about 2 months ago by
SFconvertbot
New activity in
timm/efficientformerv2_l.snap_dist_in1k
about 2 months ago
Adding `safetensors` variant of this model
#1 opened about 2 months ago by
SFconvertbot
New activity in
timm/ViT-B-16-SigLIP-i18n-256
2 months ago
Are the languages that are supported documented anywhere?
2
#1 opened 2 months ago by
Jesse-marqo
New activity in
pixparse/cc12m-wds
2 months ago
Is this where all the data is?
1
#3 opened 2 months ago by
showstarpro
New activity in
laion/CLIP-ViT-B-32-xlm-roberta-base-laion5B-s13B-b90k
2 months ago
Upload pytorch_model.bin
3
#3 opened 3 months ago by
prasadpr20
New activity in
timm/efficientformerv2_s0.snap_dist_in1k
2 months ago
Adding `safetensors` variant of this model
#1 opened 2 months ago by
SFconvertbot
New activity in
laion/CLIP-ViT-L-14-CommonPool.XL-s13B-b90K
3 months ago
Adding `safetensors` variant of this model
#1 opened 3 months ago by
SFconvertbot
Load more