2 117 60

Raja Biswas

rbiswasfc

AI & ML interests

NLP, Generative AI

Recent Activity

upvoted a paper 1 day ago

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

upvoted a paper 1 day ago

Towards Best Practices for Open Datasets for LLM Training

upvoted a paper 1 day ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

View all activity

Articles

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 510

Organizations

rbiswasfc's activity

upvoted 3 papers 1 day ago

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published 6 days ago • 37

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 13 days ago • 49

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 5 days ago • 216

liked a Space 4 days ago

Running on Zero

🐠

Kvpress

kvpress: LLM KV cache compression made easy

upvoted an article 4 days ago

Article

Mastering Long Contexts in LLMs with KVPress

•

4 days ago

• 51

upvoted a paper 4 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 13 days ago • 268

upvoted a paper 5 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 11 days ago • 98

upvoted an article 6 days ago

Article

Yay! Organizations can now publish blog Articles

•

7 days ago

• 30

liked a model 6 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 1 day ago • 149k • 3.33k

liked a model 10 days ago

microsoft/phi-4

Text Generation • Updated 19 days ago • 214k • 1.58k

upvoted an article 12 days ago

Article

Visualize and understand GPU memory in PyTorch

Dec 24, 2024

• 171

liked a model 12 days ago

internlm/internlm3-8b-instruct

Text Generation • Updated 11 days ago • 16.8k • 188

upvoted an article 12 days ago

Article

Diving into MiniMax01 405B MoE

•

12 days ago

• 17

upvoted a paper 13 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 14 days ago • 88

upvoted a paper 17 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 19 days ago • 249

liked a model 21 days ago

PowerInfer/SmallThinker-3B-Preview

Text Generation • Updated 11 days ago • 97.8k • 371

upvoted 2 papers 21 days ago

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 45

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 64

upvoted a paper about 1 month ago

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published Dec 19, 2024 • 85

liked a model about 1 month ago

Qwen/QVQ-72B-Preview

Image-Text-to-Text • Updated 16 days ago • 183k • 526