Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2024
(4)
2023
(46)
2022
(52)
2021
(30)
2020
(48)
2019
(37)
2018
(43)
2017
(19)
2016
(6)
2015
(7)
2014
(2)
2013
(3)
2012
(3)
2011
(1)
2010
(1)
Facet Publication Year
Research Areas
Computer Vision
(303)
Artificial Intelligence and Machine Learning
(175)
Robotics
(50)
Generative AI
(41)
Computer Graphics
(38)
Computational Photography and Imaging
(19)
Autonomous Vehicles
(13)
Human Computer Interaction
(13)
VR, AR and Display Technology
(13)
Medical
(6)
Real-Time Rendering
(6)
Resilience and Safety
(6)
Hyperscale Graphics
(5)
Applied Perception
(4)
High Performance Computing
(3)
Algorithms and Numerical Methods
(2)
Esports
(2)
Computer Architecture
(1)
Events
CORL
(6)
CVPR
(36)
ECCV
(5)
ICCV
(7)
ICLR
(8)
ICML
(1)
ICRA
(15)
IROS
(6)
NeurIPS
(15)
RSS
(2)
SIGGRAPH
(5)
303 results found
Computer Vision
Clear all
Computer Vision
2023
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
Jiarui Xu,
Sifei Liu
,
Arash Vahdat
,
Wonmin Byeon
, Xiaolong Wang,
Shalini De Mello
CVPR
Hightlight top 10%
Zero-shot Pose Transfer for Unrigged Stylized 3D Characters
Jiashun Wang,
Xueting Li
,
Sifei Liu
,
Shalini De Mello
,
Orazio Gallo
, Xiaolong Wang,
Jan Kautz
CVPR
GazeNeRF: 3D-Aware Gaze Redirection with Neural Radiance Fields
Alessandro Ruzzi, Xiangwei Shi, Xi Wang, Gengyan Li,
Shalini De Mello
, Hyung Jin Chang, Xucong Zhang, Otmar Hilliges
CVPR
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
Bowen Wen
,
Jonathan Tremblay
,
Valts Blukis
,
Stephen Tyree
,
Thomas Müller
, Alex Evans,
Dieter Fox
,
Jan Kautz
,
Stan Birchfield
CVPR
Neuralangelo: High-Fidelity Neural Surface Reconstruction
Max Zhaoshuo Li
,
Thomas Müller
, Alex Evans, Russell H. Taylor, Mathias Unberath,
Ming-Yu Liu
,
Chen-Hsuan Lin
CVPR
The Best Inventions of 2023, TIME Magazine
Planning for Multi-Object Manipulation with Graph Neural Network Relational Classifiers
Yixuan Huang, Adam Conkey,
Tucker Hermans
ICRA
Magic3D: High-Resolution Text-to-3D Content Creation
Chen-Hsuan Lin
, Jun Gao, Luming Tang,
Towaki Takikawa
, Xiaohui Zeng,
Xun Huang
,
Karsten Kreis
, Sanja Fidler,
Ming-Yu Liu
,
Tsung-Yi Lin
CVPR
Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Filippos Christianos,
Peter Karkus
,
Boris Ivanovic
, Stefano V. Albrecht,
Marco Pavone
ICRA
Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation
Yunzhi Lin,
Thomas Müller
,
Jonathan Tremblay
,
Bowen Wen
,
Stephen Tyree
, Alex Evans, Patricio A. Vela,
Stan Birchfield
ICRA
FewSOL: A Dataset for Few-Shot Object Learning in Robotic Environments
Jishnu Jaykumar P,
Yu-Wei Chao
, Yu Xiang
ICRA
The Best Defense is a Good Offense: Adversarial Augmentation against Adversarial Attacks
Iuri Frosio
,
Jan Kautz
CVPR
RGB-Only Reconstruction of Tabletop Scenes for Collision-Free Manipulator Control
Zhenggang Tang,
Balakumar Sundaralingam
,
Jonathan Tremblay
,
Bowen Wen
,
Ye Yuan
,
Stephen Tyree
,
Charles Loop
, Alexander Schwing,
Stan Birchfield
ICRA
Subpixel Deblurring of Anti-Aliased Raster Clip Art
Jinfan Yang,
Nicholas Vining
, Shakiba Kheradmand, Nathan Carr, Leonid Sigal, Alla Sheffer
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
Chenhongyi Yang, Jiarui Xu,
Shalini De Mello
, Elliot J. Crowley, Xiaolong Wang
ICLR
Notable top 25%
Oral
Robust and Controllable Object-Centric Learning through Energy-based Models
Ruixiang Zhang,
Gerry Che
,
Boris Ivanovic
, Renhao Wang,
Marco Pavone
, Yoshua Bengio, Liam Paull
ICLR
Target-free Text-guided Image Manipulation
Wan-Cyuan Fan, Cheng-Fu Yang, Chiao-An Yang,
Frank Wang
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wan-Cyuan Fan, Yen-Chun Chen, Dongdong Chen, Yu Cheng, Lu Yuan,
Frank Wang
Self-Supervised Pyramid Representation Learning for Multi-Label Visual Analysis and Beyond
Cheng-Yen Hsieh, Chih-Jung Chang, Fu-En Yang,
Frank Wang
2022
Learning Robust Real-World Dexterous Grasping Policies via Implicit Shape Augmentation
Qiuyu Chen,
Karl Van Wyk
,
Yu-Wei Chao
,
Wei Yang
,
Arsalan Mousavian
, Abhishek Gupta,
Dieter Fox
CORL
Task-Relevant Failure Detection for Trajectory Predictors in Autonomous Vehicles
Alec Farid,
Sushant Veer
,
Boris Ivanovic
,
Karen Leung
,
Marco Pavone
CORL
Robust Trajectory Prediction against Adversarial Attacks
Yulong Cao
,
Danfei Xu
,
Xinshuo Weng
, Z. Morely Mao, Anima Anandkumar,
Chaowei Xiao
,
Marco Pavone
CORL
Selected for Oral Presentation
MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare
Yann Labbe, Lucas Manuelli,
Arsalan Mousavian
,
Stephen Tyree
,
Stan Birchfield
,
Jonathan Tremblay
, et al.
CORL
Motion Policy Networks
Adam Fishman,
Adithya Murali
,
Clemens Eppner
, Bryan Peele, Byron Boots,
Dieter Fox
"This is my unicorn, Fluffy": Personalizing frozen vision-language representations
Niv Cohen,
Rinon Gal
,
Eli Meirom
,
Gal Chechik
,
Yuval Atzmon
ECCV
Paraphrasing Is All You Need for Novel Object Captioning
Cheng-Fu Yang, Yao-Hung Hubert Tsai, Wan-Cyuan Fan, Ruslan Salakhutdinov, Louis-Philippe Morency,
Frank Wang
NeurIPS
Structural Pruning via Latency-Saliency Knapsack
Maying Shen,
Hongxu Danny Yin
,
Pavlo Molchanov
, Lei Mao, Jianna Liu, Jose M. Alvarez
Embodied Scene-aware Human Pose Estimation
Zhengyi Luo, Shun Iwase,
Ye Yuan
, Kris Kitani
NeurIPS
SPoVT: Semantic-Prototype Variational Transformer for Dense Point Cloud Semantic Completion
Sheng-Yu Huang, Hao-Yu Hsu,
Yu-Chiang Frank Wang
NeurIPS
GENIE: Higher-Order Denoising Diffusion Solvers
Tim Dockhorn,
Arash Vahdat
,
Karsten Kreis
NeurIPS
6-DoF Pose Estimation of Household Objects for Robotic Manipulation: An Accessible Dataset and Benchmark
Stephen Tyree
,
Jonathan Tremblay
,
Stan Birchfield
, et al.
IROS
Heterogeneous-Agent Trajectory Forecasting Incorporating Class Uncertainty
Boris Ivanovic
, Kuan-Hui Lee, Pavel Tokmakov, Blake Wulfe, Adrien Gaidon,
Marco Pavone
Text2LIVE: Text-Driven Layered Image and Video Editing
Omer Bar-Tal, Dolev Ofri-Amar, Rafail Fridman,
Yoni Kasten
, Tali Dekel
ECCV
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Current page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
…
Next page
Next ›
Last page
Last »