Nauman Mustafa's picture

Nauman Mustafa

naxautify

·

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

tencent/HunyuanVideo

liked a model 7 days ago

deepseek-ai/DeepSeek-R1

liked a model 12 days ago

MiniMaxAI/MiniMax-VL-01

View all activity

Organizations

naxautify's activity

upvoted a paper 4 months ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16, 2024 • 41

upvoted 2 papers 7 months ago

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Paper • 2406.11896 • Published Jun 14, 2024 • 20

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 60

upvoted a paper 8 months ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10, 2024 • 67

upvoted a paper 9 months ago

MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels

Paper • 2405.07526 • Published May 13, 2024 • 19

upvoted 2 papers 10 months ago

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8, 2024 • 83

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2, 2024 • 57

upvoted 5 papers 11 months ago

SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents

Paper • 2401.10935 • Published Jan 17, 2024 • 4

Training-Free Long-Context Scaling of Large Language Models

Paper • 2402.17463 • Published Feb 27, 2024 • 20

Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming

Paper • 2402.14261 • Published Feb 22, 2024 • 10

PALO: A Polyglot Large Multimodal Model for 5B People

Paper • 2402.14818 • Published Feb 22, 2024 • 23

Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23, 2024 • 70

upvoted 5 papers 12 months ago

Rolling Diffusion Models

Paper • 2402.09470 • Published Feb 12, 2024 • 11

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

Paper • 2402.06619 • Published Feb 9, 2024 • 55

SubGen: Token Generation in Sublinear Time and Memory

Paper • 2402.06082 • Published Feb 8, 2024 • 11

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

Paper • 2401.16158 • Published Jan 29, 2024 • 19

WebLINX: Real-World Website Navigation with Multi-Turn Dialogue

Paper • 2402.05930 • Published Feb 8, 2024 • 39

upvoted a paper about 1 year ago

Learning to Compress Prompts with Gist Tokens

Paper • 2304.08467 • Published Apr 17, 2023 • 3

upvoted 2 papers over 1 year ago

Generative Image Dynamics

Paper • 2309.07906 • Published Sep 14, 2023 • 53

InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

Paper • 2309.06380 • Published Sep 12, 2023 • 32