48 35 81

Kashif Rasul

kashif

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

upvoted a paper 4 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

upvoted an article 9 days ago

Process Reinforcement through Implicit Rewards

liked a Space 27 days ago

HuggingFaceH4/blogpost-scaling-test-time-compute

View all activity

Articles

Organizations

kashif's activity

upvoted a paper 4 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 5 days ago • 194

upvoted an article 9 days ago

Article

Process Reinforcement through Implicit Rewards

•

10 days ago

• 15

liked a Space 27 days ago

Running

463

📈

Scaling test-time compute

liked a Space 28 days ago

Running

🥇

Fev Leaderboard

liked a model about 1 month ago

nicolas-dufour/PLONK_YFCC

Updated Dec 12, 2024 • 314 • 12

updated a model about 1 month ago

huggingface/timesfm-tourism-monthly

Updated Dec 9, 2024 • 11 • 1

upvoted a paper about 1 month ago

Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Paper • 2407.00079 • Published Jun 24, 2024 • 5

liked a model about 1 month ago

flair/bueble-lm-2b

Text Generation • Updated Dec 6, 2024 • 124 • 20

upvoted a paper about 1 month ago

RRM: Robust Reward Model Training Mitigates Reward Hacking

Paper • 2409.13156 • Published Sep 20, 2024 • 4

liked a model about 1 month ago

TianqiLiuAI/RM-gemma2-2b

Text Generation • Updated Nov 18, 2024 • 18 • 1

updated a dataset about 2 months ago

trl-lib/alpaca-cleaned

Viewer • Updated Nov 28, 2024 • 51.8k • 55

liked a dataset about 2 months ago

ylecun/mnist

Viewer • Updated Aug 8, 2024 • 70k • 25.4k • 148

updated a model about 2 months ago

HuggingFaceTB/SmolVLM-Instruct-DPO

Image-Text-to-Text • Updated Nov 26, 2024 • 392 • 15

liked 2 models about 2 months ago

apple/coreml-mobileclip

Updated Nov 19, 2024 • 255 • 35

apple/aimv2-large-patch14-448

Image Feature Extraction • Updated Nov 28, 2024 • 12.5k • 1

liked a dataset about 2 months ago

Maple728/Time-300B

Preview • Updated Oct 22, 2024 • 2.28k • 16

liked a Space 2 months ago

Running

🥇

GIFT Eval

GIFT-Eval: A Benchmark for General Time Series Forecasting

liked a model 3 months ago

jimmycarter/LibreFLUX

Text-to-Image • Updated Oct 24, 2024 • 160 • 158

upvoted a paper 3 months ago

A Rate-Distortion View of Uncertainty Quantification

Paper • 2406.10775 • Published Jun 16, 2024 • 1

updated a dataset 4 months ago

kashif/chronos-preference

Preview • Updated Sep 26, 2024 • 29

Kashif Rasul

AI & ML interests

Recent Activity

Articles

How NuminaMath Won the 1st AIMO Progress Prize

Preference Optimization for Vision Language Models

🧨 Diffusers welcomes Stable Diffusion 3

Patch Time Series Transformer in Hugging Face

Constitutional AI with Open LLMs

PatchTSMixer in HuggingFace

Preference Tuning LLMs with Direct Preference Optimization Methods

Finetune Stable Diffusion Models with DDPO via TRL

Introducing Würstchen: Fast Diffusion for Image Generation

Fine-tune Llama 2 with DPO

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Multivariate Probabilistic Time Series Forecasting with Informer

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Probabilistic Time Series Forecasting with 🤗 Transformers

The Annotated Diffusion Model

Organizations

kashif's activity

Process Reinforcement through Implicit Rewards

Scaling test-time compute

Fev Leaderboard

GIFT Eval