Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(24)
2024
(104)
2023
(159)
2022
(164)
2021
(153)
2020
(135)
2019
(123)
2018
(107)
2017
(85)
2016
(57)
2015
(52)
2014
(23)
2013
(26)
2012
(20)
2011
(29)
2010
(19)
2009
(9)
2008
(14)
2007
(10)
2006
(1)
2005
(3)
2003
(1)
2001
(1)
Facet Publication Year
Research Areas
Artificial Intelligence and Machine Learning
(473)
Computer Graphics
(331)
Computer Vision
(324)
Computer Architecture
(222)
Robotics
(128)
Generative AI
(116)
Circuits and VLSI Design
(107)
High Performance Computing
(101)
Real-Time Rendering
(92)
Algorithms and Numerical Methods
(91)
VR, AR and Display Technology
(69)
Resilience and Safety
(61)
Autonomous Vehicles
(56)
Human Computer Interaction
(52)
Programming Languages, Systems and Tools
(51)
Computational Photography and Imaging
(50)
Speech Processing
(46)
Applied Perception
(37)
Natural Language Processing
(33)
Esports
(29)
Medical
(17)
Hyperscale Graphics
(16)
Networking
(13)
Telecommunications
(13)
Machine Translation
(12)
Quantum Computing
(5)
Physical AI
(3)
Climate Simulation
(2)
Storage and Systems
(1)
Events
CORL
(17)
CVPR
(46)
ECCV
(9)
ICCV
(9)
ICLR
(23)
ICML
(13)
ICRA
(31)
IROS
(13)
ISPD
(7)
NeurIPS
(33)
PLDI
(1)
RSS
(8)
SIGGRAPH
(65)
VSS
(2)
2025
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Ali Hatamizadeh
,
Jan Kautz
CVPR
Marco: Configurable Graph-Based Task Solving and Multi-AI Agents Framework for Hardware Design
Chia-Tung (Mark) Ho
, Jing Gong,
Yunsheng Bai
,
Chenhui Deng
,
Haoxing (Mark) Ren
,
Brucek Khailany
Fugatto 1 - Foundational Generative Audio Transformer Opus 1
Rafael Valle, Rohan Badlani, Zhifeng Kong, Sang-gil Lee, Arushi Goel, Sungwon Kim, Joao Felipe Santos, Shuqi Dai,
Siddharth Gururani
, Aya AIJa'fari, Alex Liu, Kevin Shih, Wei Ping,
Huck Yang
, Bryan Catanzaro
ICLR
Gated Delta Networks: Improving Mamba2 with Delta Rule
Songlin Yang,
Jan Kautz
,
Ali Hatamizadeh
ICLR
Audio Large Language Models Can Be Descriptive Speech Quality Evaluators
Chen Chen, Yuchen Hu, Siyin Wang, Helin Wang, Zhehuai Chen, Chao Zhang,
Huck Yang
, EngSiong Chng
ICLR
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation
Alexander H. Liu, Sang-gil Lee,
Huck Yang
, Yuan Gong,
Frank Wang
, James R. Glas, Rafael Valle
ICLR
Toward Understanding Display Size for FPS Esports Aiming
Arjun Madhusudan,
Josef Spjut
, Benjamin Watson, Seth Schneider,
Ben Boudaoud
,
Joohwan Kim
Towards Neural Scaling Laws for Time Series Foundation Models
Qingren Yao,
Huck Yang
, Renhe Jiang, Ming Jin, Shirui Pan
ICLR
Composing Distributed Computations Through Task and Kernel Fusion
Rohan Yadav, Shiv Sundrum, Wonchan Lee,
Michael Garland
,
Michael Bauer
, Alex Aiken, Fredrik Kjolstad
Automatic Tracing in Task-Based Runtime Systems
Rohan Yadav,
Michael Bauer
, David Broman,
Michael Garland
, Alex Aiken, Fredrik Kjolstad
Pushing the Limits? Frame Rate Benefits to Players for up to 500 Hz in First Person Shooter Games
Samin Shahriar Tokey,
Ben Boudaoud
,
Joohwan Kim
,
Josef Spjut
, Mark Claypool
Cosmos-Reason 1: From Physical AI Common Sense to Embodied Decisions
Tsung-Yi Lin
,
Ming-Yu Liu
NVIDIA Isaac GR00T N1: An Open Foundation Model for Humanoid Robots
Yuke Zhu
,
Linxi "Jim" Fan
, NVIDIA GEAR Team
Spatio-Temporal Context Prompting for Zero-Shot Action Detection
Wei-Jhe Huang,
Min-Hung Chen
, Shang-Hong Lai
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Ci-Siang Lin, Chien-Yi Wang,
Frank Wang
,
Min-Hung Chen
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models
Kuan-Hung Liu, Cheng-Kun Yang,
Min-Hung Chen
, Yu-Lun Liu, Yen-Yu Lin
eXtended Reality and Artificial Intelligence in Medicine and Rehabilitation
Tomas Krilavičius, Lucio Tommaso De Paolis, Valerio De Luca,
Josef Spjut
Energy-Based Diffusion Language Models for Text Generation
Minkai Xu,
Tomas Geffner
,
Karsten Kreis
,
Weili Nie
,
Yilun Xu
, Jure Leskovec, Stefano Ermon,
Arash Vahdat
ICLR
Truncated Consistency Models
Sangyun Lee,
Yilun Xu
,
Tomas Geffner
, Giulia Fanti,
Karsten Kreis
,
Arash Vahdat
,
Weili Nie
ICLR
Proteina: Scaling Flow-based Protein Structure Generative Models
Tomas Geffner
,
Kieran Didi
, Zuobai Zhang, Danny Reidenbach, Zhonglin Cao, Jason Yim, Mario Geiger, Christian Dallago, Emine Kucukbenli,
Arash Vahdat
,
Karsten Kreis
ICLR
ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids
Hannes Stark, Bowen Jing,
Tomas Geffner
, Jason Yim, Tommi Jaakkola,
Arash Vahdat
,
Karsten Kreis
ICLR
High-Precision Benchmarks for the Stochastic Rod
Eugene d'Eon
, Anil Prinja
Directed Graph Generation with Heat Kernels
Marc T. Law,
Karsten Kreis
,
Haggai Maron
Cosmos World Foundation Model Platform for Physical AI
Ming-Yu Liu
, Many other contributors at https://d1qx31qr3h6wln.cloudfront.net/publications/NVIDIA%20Cosmos_4.pdf
2024
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Huck Yang
, Taejin Park, Yuan Gong, Yuanchao Li, Zhehuai Chen, yen-ting Lin, Chen Chen, Yuchen Hu, Kunal Dhawan, Piotr Zelasko, Chao Zhang, Yun-Nung Chen, Yu Tsao, Jagadeesh Balam, Boris Ginsburg, Shinji Watanabe, Andreas Stolcke
Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization
Siyi Gu, Minkai Xu, Alexander Powers,
Weili Nie
,
Tomas Geffner
,
Karsten Kreis
, Jure Leskovec,
Arash Vahdat
, Stefano Ermon
NeurIPS
Molecule Generation with Fragment Retrieval Augmentation
Seul Lee,
Karsten Kreis
, Srimukh Prasad Veccham, Meng Liu, Danny Reidenbach, Saee Paliwal,
Arash Vahdat
,
Weili Nie
NeurIPS
L4GM: Large 4D Gaussian Reconstruction Model
Jiawei Ren, Kevin Xie, Ashkan Mirzaei, Hanxue Liang, Xiaohui Zeng,
Karsten Kreis
, Ziwei Liu, Antonio Torralba, Sanja Fidler, Seung Wook Kim, Huan Ling
NeurIPS
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
Giannis Daras,
Weili Nie
,
Karsten Kreis
, Alexandros G. Dimakis,
Morteza Mardani
,
Nikola Kovachki
,
Arash Vahdat
NeurIPS
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai, Hsiang-Chun Wang, Ping-Chun Hsieh,
Frank Wang
,
Min-Hung Chen
, Shao-Hua Sun
NeurIPS
Fast Encoder-Based 3D from Casual Videos via Point Track Processing
Yoni Kasten
, Wuyue Lu,
Haggai Maron
NeurIPS
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models
Yuchen Hu, Chen Chen,
Huck Yang
, Chengwei Qin, Pin-Yu Chen, Eng Siong Chng, Chao Zhang
NeurIPS
Pagination
Current page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
…
Next page
Next ›
Last page
Last »