NVIDIA

Enterprise

company

Verified

https://www.nvidia.com/

nvidia

AI & ML interests

None defined yet.

Recent Activity

fitsumreda updated a model about 19 hours ago

nvidia/Cosmos-1.0-Tokenizer-CV8x8x8

fitsumreda updated a model about 19 hours ago

nvidia/Cosmos-1.0-Tokenizer-DV8x16x16

Haoxiang-Wang new activity 1 day ago

nvidia/Cosmos-1.0-Autoregressive-4B:access restriction

View all activity

nvidia's activity

fitsumreda

updated 2 models about 19 hours ago

nvidia/Cosmos-1.0-Tokenizer-CV8x8x8

Updated about 19 hours ago • 1.54k • 11

nvidia/Cosmos-1.0-Tokenizer-DV8x16x16

Updated about 19 hours ago • 597 • 13

Haoxiang-Wang

in nvidia/Cosmos-1.0-Autoregressive-4B 1 day ago

access restriction

#3 opened 1 day ago by

Access restrictions

#2 opened 4 days ago by

zhilinw

in nvidia/Llama-3.1-Nemotron-70B-Reward-HF 2 days ago

Why the hf-format model does not have rm head, since the original format model does have.

#7 opened 11 days ago by

zhiyucheng

updated 3 models 2 days ago

nvidia/Llama-3.1-405B-Instruct-FP8

Updated 2 days ago • 280 • 6

nvidia/Llama-3.1-70B-Instruct-FP8

Updated 2 days ago • 202 • 6

nvidia/Llama-3.1-8B-Instruct-FP8

Updated 2 days ago • 625 • 9

ahatamiz

authored a paper 7 days ago

ViR: Vision Retention Networks

Paper • 2310.19731 • Published Oct 30, 2023 • 1

talor-abr

authored a paper about 1 month ago

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Paper • 2411.19146 • Published Nov 28, 2024 • 14

zijiac-nvidia

authored a paper about 2 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 40

zihanliu

authored 9 papers 4 months ago

Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study

Paper • 2304.06762 • Published Apr 13, 2023 • 1

Retrieval meets Long Context Large Language Models

Paper • 2310.03025 • Published Oct 4, 2023 • 4

XPersona: Evaluating Multilingual Personalized Chatbot

Paper • 2003.07568 • Published Mar 17, 2020

CrossNER: Evaluating Cross-Domain Named Entity Recognition

Paper • 2012.04373 • Published Dec 8, 2020

Are Multilingual Models Effective in Code-Switching?

Paper • 2103.13309 • Published Mar 24, 2021

ChatQA: Building GPT-4 Level Conversational QA Models

Paper • 2401.10225 • Published Jan 18, 2024 • 34

Multi-Stage Prompting for Knowledgeable Dialogue Generation

Paper • 2203.08745 • Published Mar 16, 2022

Nemotron-4 340B Technical Report

Paper • 2406.11704 • Published Jun 17, 2024

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

Paper • 2407.02485 • Published Jul 2, 2024 • 5