Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI about 1 month ago • 12
Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents Apr 28 • 62
The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics Mar 16 • 31
Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline Mar 13 • 40
Build an Agent That Thinks Like a Data Scientist: How We Hit #1 on DABStep with Reusable Tool Generation Mar 13 • 18
NVIDIA Nemotron 2 Nano 9B Japanese: State-of-the-Art Small Language Model Customized for Japanese Sovereign AI Feb 17 • 3
Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models Jan 6 • 28
The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator Dec 17, 2025 • 50
Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models Dec 15, 2025 • 113
How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare Oct 28, 2025 • 20
NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks Oct 28, 2025 • 17
Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI Oct 28, 2025 • 21
Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes Oct 22, 2025 • 11
Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard Oct 21, 2025 • 14
Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models Oct 20, 2025 • 19
📢 NVIDIA Releases Nemotron-CC-Math Pre-Training Dataset: A High-Quality, Web-Scale Math Corpus for Pretraining Large Language Models Aug 18, 2025 • 5
NVIDIA Releases Improved Pretraining Dataset: Preserves High Value Math & Code, and Augments with Multi-Lingual Aug 18, 2025 • 4
NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks Aug 11, 2025 • 76
Llama-NeMoRetriever-ColEmbed: Developer-Focused Guide to NVIDIA's State-of-the-Art Text-Image Retrieval Jul 9, 2025 • 4
Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions Jun 10, 2025 • 25
SparDA: Sparse Decoupled Attention for Efficient Long-Context LLM Inference Paper • 2606.04511 • Published Jun 3 • 3
SparDA: Sparse Decoupled Attention for Efficient Long-Context LLM Inference Paper • 2606.04511 • Published Jun 3 • 3
RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression Paper • 2502.14051 • Published Aug 13, 2025
Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published Mar 6, 2025 • 97
Guiding a Diffusion Model with a Bad Version of Itself Paper • 2406.02507 • Published Jun 4, 2024 • 17
Progressive Growing of GANs for Improved Quality, Stability, and Variation Paper • 1710.10196 • Published Oct 27, 2017
StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis Paper • 2301.09515 • Published Jan 23, 2023
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers Paper • 2211.01324 • Published Nov 2, 2022 • 4
Generative Novel View Synthesis with 3D-Aware Diffusion Models Paper • 2304.02602 • Published Apr 5, 2023