1 15 11

Zeb K

baobaoh

zebwithb

AI & ML interests

None yet

Recent Activity

upvoted a collection 8 days ago

Qwen2.5-VL

upvoted a paper 8 days ago

Qwen2.5-VL Technical Report

upvoted a paper 8 days ago

Reward Steering with Evolutionary Heuristics for Decoding-time Alignment

View all activity

Organizations

None yet

baobaoh's activity

upvoted a collection 8 days ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 8 items • Updated 14 days ago • 391

upvoted 4 papers 8 days ago

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published 12 days ago • 42

New activity in m-a-p/MERT-v1-95M 9 days ago

MERT-v1-95M not compatible with Transformers >=4.44.0

#4 opened 9 days ago by

baobaoh

upvoted a paper 9 days ago

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published 12 days ago • 38

upvoted an article 9 days ago

Article

The Large Language Model Course

•

Jan 16

• 126

upvoted 8 papers 11 days ago

CritiQ: Mining Data Quality Criteria from Human Preferences

Paper • 2502.19279 • Published 12 days ago • 9

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published about 1 month ago • 122

Distillation Scaling Laws

Paper • 2502.08606 • Published 26 days ago • 46

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 26 days ago • 182

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 26 days ago • 143

Large Language Diffusion Models

Paper • 2502.09992 • Published 24 days ago • 100

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 18 days ago • 59

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 18 days ago • 177

liked a model 11 days ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 256k • • 1.71k

liked a model 15 days ago

openbmb/MiniCPM-o-2_6

Any-to-Any • Updated 7 days ago • 392k • 1.04k

liked a Space 19 days ago

Music2emo

📊

Towards Unified Music Emotion Recognition across Dimensional

liked a model 20 days ago

amaai-lab/music2emo

Updated 26 days ago • 2