Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(9)
2024
(94)
2023
(154)
2022
(162)
2021
(151)
2020
(135)
2019
(123)
2018
(107)
2017
(85)
2016
(57)
2015
(52)
2014
(23)
2013
(26)
2012
(20)
2011
(29)
2010
(19)
2009
(9)
2008
(14)
2007
(10)
2006
(1)
2005
(3)
2003
(1)
2001
(1)
Facet Publication Year
Research Areas
Artificial Intelligence and Machine Learning
(457)
Computer Graphics
(329)
Computer Vision
(318)
Computer Architecture
(222)
Robotics
(125)
Circuits and VLSI Design
(104)
Generative AI
(102)
High Performance Computing
(101)
Real-Time Rendering
(92)
Algorithms and Numerical Methods
(91)
VR, AR and Display Technology
(67)
Resilience and Safety
(61)
Human Computer Interaction
(52)
Programming Languages, Systems and Tools
(51)
Computational Photography and Imaging
(50)
Speech Processing
(43)
Autonomous Vehicles
(40)
Applied Perception
(31)
Natural Language Processing
(29)
Esports
(27)
Medical
(17)
Hyperscale Graphics
(16)
Networking
(13)
Telecommunications
(13)
Machine Translation
(12)
Quantum Computing
(5)
Climate Simulation
(2)
Physical AI
(1)
Storage and Systems
(1)
Events
CORL
(17)
CVPR
(41)
ECCV
(9)
ICCV
(9)
ICLR
(15)
ICML
(11)
ICRA
(30)
IROS
(13)
ISPD
(7)
NeurIPS
(29)
PLDI
(1)
RSS
(6)
SIGGRAPH
(65)
VSS
(2)
2025
Gated Delta Networks: Improving Mamba2 with Delta Rule
Songlin Yang,
Jan Kautz
,
Ali Hatamizadeh
ICLR
Composing Distributed Computations Through Task and Kernel Fusion
Rohan Yadav, Shiv Sundrum, Wonchan Lee,
Michael Garland
,
Michael Bauer
, Alex Aiken, Fredrik Kjolstad
Automatic Tracing in Task-Based Runtime Systems
Rohan Yadav,
Michael Bauer
, David Broman,
Michael Garland
, Alex Aiken, Fredrik Kjolstad
Spatio-Temporal Context Prompting for Zero-Shot Action Detection
Wei-Jhe Huang,
Min-Hung Chen
, Shang-Hong Lai
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Ci-Siang Lin, Chien-Yi Wang,
Frank Wang
,
Min-Hung Chen
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models
Kuan-Hung Liu, Cheng-Kun Yang,
Min-Hung Chen
, Yu-Lun Liu, Yen-Yu Lin
eXtended Reality and Artificial Intelligence in Medicine and Rehabilitation
Tomas Krilavičius, Lucio Tommaso De Paolis, Valerio De Luca,
Josef Spjut
High-Precision Benchmarks for the Stochastic Rod
Eugene d'Eon
, Anil Prinja
Cosmos World Foundation Model Platform for Physical AI
Ming-Yu Liu
, Many other contributors at https://d1qx31qr3h6wln.cloudfront.net/publications/NVIDIA%20Cosmos_4.pdf
2024
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai, Hsiang-Chun Wang, Ping-Chun Hsieh,
Frank Wang
,
Min-Hung Chen
, Shao-Hua Sun
NeurIPS
Fast Encoder-Based 3D from Casual Videos via Point Track Processing
Yoni Kasten
, Wuyue Lu,
Haggai Maron
NeurIPS
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models
Yuchen Hu, Chen Chen,
Huck Yang
, Chengwei Qin, Pin-Yu Chen, Eng Siong Chng, Chao Zhang
NeurIPS
MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting
Chen Tessler
, Kelly Guo, Ofir Nabati,
Gal Chechik
, Jason Peng
SIGGRAPH
Large Étendue 3D Holographic Display with Content-adpative Dynamic Fourier Modulation
Brian Chao, Manu Gopakumar, Suyeon Choi, Liang Shi,
Jonghyun Kim
, Gordon Wetzstein
SIGGRAPH
SpecTrack: Learned Multi-Rotation Tracking via Speckle Imaging
Ziyang Chen, Doğa Doğan,
Josef Spjut
, Kaan Akşit
SIGGRAPH
Honorable Mention
DRC-Coder: Automated DRC Checker Code Generation Using LLM Autonomous Agent
Chen-Chia Chang,
Chia-Tung (Mark) Ho
, Yaguang Li, Yiran Chen,
Haoxing (Mark) Ren
Fugatto 1 - Foundational Generative Audio Transformer Opus 1
Rafael Valle, Rohan Badlani, Zhifeng Kong, Sang-gil Lee, Arushi Goel, Sungwon Kim, Joao Felipe Santos, Shuqi Dai,
Siddharth Gururani
, Aya AIJa'fari, Alex Liu, Kevin Shih, Wei Ping,
Huck Yang
, Bryan Catanzaro
Appearance Modeling of Iridescent Feathers with Diverse Nanostructures
Yunchen Yu,
Andrea Weidlich
, Bruce Walter,
Eugene d'Eon
, Steve Marschner
SIGGRAPH
SIGGRAPH Asia 2024 Best Paper Award
scene_synthesizer: A Python Library for Procedural Scene Generation in Robot Manipulation
Clemens Eppner
,
Adithya Murali
,
Caelan Garrett
,
Rowland O'Flaherty
,
Tucker Hermans
,
Wei Yang
,
Dieter Fox
Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities
Siyin Wang,
Huck Yang
, Ji Wu, Chao Zhang
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment
Yusuke Hirota,
Ryo Hachiuma
,
Huck Yang
, Yuta Nakashima
FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model
Yichen Lu, Jiaqi Song,
Huck Yang
, Shinji Watanabe
Differentiable GPU-Parallelized Task and Motion Planning
William Shen,
Caelan Garrett
,
Ankit Goyal
,
Tucker Hermans
,
Fabio Ramos
CORL
SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation
Zihan Zhou, Animesh Garg,
Dieter Fox
,
Caelan Garrett
,
Ajay Mandlekar
CORL
SkillGen: Automated Demonstration Generation for Efficient Skill Learning and Deployment
Caelan Garrett
,
Ajay Mandlekar
,
Bowen Wen
,
Dieter Fox
CORL
NOD-TAMP: Generalizable Long-Horizon Planning with Neural Object Descriptors
Shuo Cheng,
Caelan Garrett
,
Ajay Mandlekar
,
Danfei Xu
CORL
Reconstructing Translucent Thin Objects from Photos
Xi Deng,
Lifan Wu
, Bruce Walter,
Eugene d'Eon
, Ravi Ramamoorthi, Steve Marschner,
Andrea Weidlich
SIGGRAPH
HAMSTER: Hierarchical Action Models for Open-World Robot Manipulation
Yi Li, Yuquan Deng, Jesse Zhang, Joel Jang, Marius Memmel,
Caelan Garrett
,
Fabio Ramos
,
Dieter Fox
,
Anqi Li
, Abhishek Gupta,
Ankit Goyal
CORL
Open-World Task and Motion Planning via Vision-Language Model Inferred Constraints
Nishanth Kumar,
Fabio Ramos
,
Dieter Fox
,
Caelan Garrett
CORL
Guiding Long-Horizon Task and Motion Planning with Vision Language Models
Zhutian Yang,
Caelan Garrett
,
Dieter Fox
, Tomás Lozano-Pérez, Leslie Pack Kaelbling
CORL
Constructability-driven design of frame structures with state-space search methods
Yijiang Huang,
Caelan Garrett
, Caitlin Mueller
Interventional Data Generation for Robust and Data-Efficient Robot Imitation Learning
Ryan Hoque,
Ajay Mandlekar
,
Caelan Garrett
, Ken Goldberg,
Dieter Fox
IROS
Pagination
Current page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
…
Next page
Next ›
Last page
Last »