Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
16
51
143
Krishna Kaasyap
KrishnaKaasyap
Follow
Juanelopo's profile picture
21world's profile picture
victor's profile picture
3 followers
·
16 following
krishnakaasyap
krishnakaasyap.bsky.social
AI & ML interests
Test Time Training Multimodal & Inter-Modality Transfer Learning Mechanistic Interpretability Evolutionary Model Merging Swarm Intelligence of multiple models with different architectures and different algorithms MuZero approach to general tasks
Recent Activity
liked
a model
about 1 hour ago
deepseek-ai/Janus-Pro-7B
new
activity
7 days ago
deepseek-ai/DeepSeek-R1-Distill-Llama-70B:
SFT (Non-RL) distillation is this good on a sub-100B model?
liked
a model
7 days ago
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
View all activity
Organizations
KrishnaKaasyap
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
about 1 hour ago
deepseek-ai/Janus-Pro-7B
Any-to-Any
•
Updated
about 2 hours ago
•
311
liked
3 models
7 days ago
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation
•
Updated
1 day ago
•
30.2k
•
271
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
1 day ago
•
149k
•
3.32k
deepseek-ai/DeepSeek-R1-Zero
Text Generation
•
Updated
1 day ago
•
6.66k
•
504
liked
a model
13 days ago
MiniMaxAI/MiniMax-Text-01
Text Generation
•
Updated
11 days ago
•
4.68k
•
481
liked
a Space
25 days ago
Running
779
🦀
InstantCoder
liked
a dataset
27 days ago
PowerInfer/QWQ-LONGCOT-500K
Viewer
•
Updated
Dec 26, 2024
•
286k
•
1.85k
•
118
liked
a model
27 days ago
PowerInfer/SmallThinker-3B-Preview
Text Generation
•
Updated
11 days ago
•
97.8k
•
371
liked
3 models
about 1 month ago
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
Updated
16 days ago
•
183k
•
526
deepseek-ai/DeepSeek-V3
Text Generation
•
Updated
3 days ago
•
278k
•
2.41k
deepseek-ai/DeepSeek-V3-Base
Updated
3 days ago
•
20.9k
•
1.37k
liked
a Space
about 1 month ago
Running
870
🔍
QwQ-32B-Preview
QwQ-32B-Preview
liked
3 models
about 2 months ago
deepseek-ai/DeepSeek-V2.5-1210
Text Generation
•
Updated
Dec 11, 2024
•
365k
•
244
tencent/HunyuanVideo
Text-to-Video
•
Updated
6 days ago
•
8.01k
•
1.52k
Qwen/QwQ-32B-Preview
Text Generation
•
Updated
16 days ago
•
178k
•
•
1.59k
liked
a model
2 months ago
mistralai/Mistral-Large-Instruct-2411
Updated
Nov 19, 2024
•
1.39M
•
194
liked
a Space
3 months ago
Running
1.28k
🐢
Qwen2.5 Coder Artifacts
liked
2 models
3 months ago
Etched/oasis-500m
Updated
Nov 4, 2024
•
170
•
437
ssmits/Qwen2.5-95B-Instruct
Text Generation
•
Updated
Oct 31, 2024
•
58
•
3
liked
a model
4 months ago
mlabonne/BigLlama-3.1-681B-Instruct
Text Generation
•
Updated
Aug 4, 2024
•
16
•
11
Load more