Sugato Ray's picture

Sugato Ray

sugatoray

·

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

updated a collection about 2 hours ago

upvoted a paper about 2 hours ago

Agentless: Demystifying LLM-based Software Engineering Agents

updated a collection about 3 hours ago

LLM Training Datasets

View all activity

Organizations

sugatoray's activity

upvoted a paper about 2 hours ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 54

upvoted a paper 1 day ago

KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model

Paper • 2501.01028 • Published 11 days ago • 10

upvoted a collection 1 day ago

KaLM-embedding

5 items • Updated 10 days ago • 19

upvoted a collection 2 days ago

Jan 10 Releases 🌨️

38 items • Updated 3 days ago • 10

upvoted 2 papers 4 days ago

Proactive Conversational Agents with Inner Thoughts

Paper • 2501.00383 • Published 13 days ago • 1

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 5 days ago • 68

upvoted 3 collections 4 days ago

Docling

1 item • Updated 28 days ago • 2

Deepseek V3 (All Versions)

Deepseek V3 - available in bf16, original, and GGUF formats, with support for 2, 3, 4, 5, 6 and 8-bit quantized versions. • 3 items • Updated about 5 hours ago • 21

Phi-4

Phi-4 small language model. • 2 items • Updated 4 days ago • 34

upvoted a paper 4 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 9 days ago • 72

upvoted 2 collections 5 days ago

Cosmos Tokenizer

A suite of image and video tokenizers • 13 items • Updated 2 days ago • 36

Cosmos

The collection of Cosmos models • 31 items • Updated 2 days ago • 213

upvoted an article 6 days ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

By

•

10 days ago

• 29

upvoted a paper 7 days ago

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 187

upvoted a collection 7 days ago

GAIA release

Gather the items of the GAIA release • 4 items • Updated Nov 23, 2023 • 20

upvoted a collection 8 days ago

🤖 Agents

21 items • Updated 13 days ago • 97

upvoted an article 8 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By

•

10 days ago

• 37

upvoted a collection 11 days ago

SwiftKV Models

SwiftKV reduces prefill compute by up to 50% by combining model rewiring and knowledge-preserving self-distillation. • 3 items • Updated Dec 5, 2024 • 3

upvoted a paper 11 days ago

Xmodel-2 Technical Report

Paper • 2412.19638 • Published 17 days ago • 23

upvoted an article 11 days ago

Article

Fine-tune ModernBERT for text classification using synthetic data

By

•

14 days ago

• 22