Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(4)
2024
(18)
2023
(46)
2022
(52)
2021
(31)
2020
(48)
2019
(37)
2018
(43)
2017
(19)
2016
(6)
2015
(7)
2014
(2)
2013
(3)
2012
(3)
2011
(1)
2010
(1)
Facet Publication Year
Research Areas
Computer Vision
(321)
Artificial Intelligence and Machine Learning
(180)
Robotics
(54)
Generative AI
(47)
Computer Graphics
(42)
Computational Photography and Imaging
(19)
Autonomous Vehicles
(18)
Human Computer Interaction
(13)
VR, AR and Display Technology
(13)
Applied Perception
(11)
Medical
(6)
Real-Time Rendering
(6)
Resilience and Safety
(6)
Hyperscale Graphics
(5)
Natural Language Processing
(5)
High Performance Computing
(3)
Algorithms and Numerical Methods
(2)
Esports
(2)
Computer Architecture
(1)
Speech Processing
(1)
Events
CORL
(6)
CVPR
(41)
ECCV
(7)
ICCV
(7)
ICLR
(8)
ICML
(2)
ICRA
(16)
IROS
(7)
NeurIPS
(16)
RSS
(3)
SIGGRAPH
(7)
321 results found
Computer Vision
Clear all
Computer Vision
2023
Robust and Controllable Object-Centric Learning through Energy-based Models
Ruixiang Zhang,
Gerry Che
,
Boris Ivanovic
, Renhao Wang,
Marco Pavone
, Yoshua Bengio, Liam Paull
ICLR
Target-free Text-guided Image Manipulation
Wan-Cyuan Fan, Cheng-Fu Yang, Chiao-An Yang,
Frank Wang
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wan-Cyuan Fan, Yen-Chun Chen, Dongdong Chen, Yu Cheng, Lu Yuan,
Frank Wang
Self-Supervised Pyramid Representation Learning for Multi-Label Visual Analysis and Beyond
Cheng-Yen Hsieh, Chih-Jung Chang, Fu-En Yang,
Frank Wang
2022
Learning Robust Real-World Dexterous Grasping Policies via Implicit Shape Augmentation
Qiuyu Chen, Karl Van Wyk,
Yu-Wei Chao
,
Wei Yang
,
Arsalan Mousavian
, Abhishek Gupta,
Dieter Fox
CORL
Task-Relevant Failure Detection for Trajectory Predictors in Autonomous Vehicles
Alec Farid,
Sushant Veer
,
Boris Ivanovic
,
Karen Leung
,
Marco Pavone
CORL
Robust Trajectory Prediction against Adversarial Attacks
Yulong Cao
,
Danfei Xu
,
Xinshuo Weng
, Z. Morely Mao, Anima Anandkumar,
Chaowei Xiao
,
Marco Pavone
CORL
Selected for Oral Presentation
MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare
Yann Labbe, Lucas Manuelli,
Arsalan Mousavian
,
Stephen Tyree
,
Stan Birchfield
,
Jonathan Tremblay
, et al.
CORL
Motion Policy Networks
Adam Fishman,
Adithya Murali
,
Clemens Eppner
, Bryan Peele, Byron Boots,
Dieter Fox
"This is my unicorn, Fluffy": Personalizing frozen vision-language representations
Niv Cohen,
Rinon Gal
,
Eli Meirom
,
Gal Chechik
,
Yuval Atzmon
ECCV
Paraphrasing Is All You Need for Novel Object Captioning
Cheng-Fu Yang, Yao-Hung Hubert Tsai, Wan-Cyuan Fan, Ruslan Salakhutdinov, Louis-Philippe Morency,
Frank Wang
NeurIPS
Structural Pruning via Latency-Saliency Knapsack
Maying Shen,
Hongxu Danny Yin
,
Pavlo Molchanov
, Lei Mao, Jianna Liu, Jose M. Alvarez
Embodied Scene-aware Human Pose Estimation
Zhengyi Luo, Shun Iwase,
Ye Yuan
, Kris Kitani
NeurIPS
SPoVT: Semantic-Prototype Variational Transformer for Dense Point Cloud Semantic Completion
Sheng-Yu Huang, Hao-Yu Hsu,
Yu-Chiang Frank Wang
NeurIPS
GENIE: Higher-Order Denoising Diffusion Solvers
Tim Dockhorn,
Arash Vahdat
,
Karsten Kreis
NeurIPS
6-DoF Pose Estimation of Household Objects for Robotic Manipulation: An Accessible Dataset and Benchmark
Stephen Tyree
,
Jonathan Tremblay
,
Stan Birchfield
, et al.
IROS
Heterogeneous-Agent Trajectory Forecasting Incorporating Class Uncertainty
Boris Ivanovic
, Kuan-Hui Lee, Pavel Tokmakov, Blake Wulfe, Adrien Gaidon,
Marco Pavone
Text2LIVE: Text-Driven Layered Image and Video Editing
Omer Bar-Tal, Dolev Ofri-Amar, Rafail Fridman,
Yoni Kasten
, Tali Dekel
ECCV
AdvDO: Realistic Adversarial Attacks for Trajectory Prediction
Yulong Cao
,
Chaowei Xiao
, Anima Anandkumar,
Danfei Xu
,
Marco Pavone
ECCV
LANA: Latency Aware Network Acceleration
Pavlo Molchanov
, Jimmy Hall,
Hongxu Danny Yin
,
Jan Kautz
, Nicolo Fusi,
Arash Vahdat
ECCV
Audio-Visual Segmentation
Jinxin Zhou, Yiran Zhong,
Stan Birchfield
, et al.
Shape, Light, and Material Decomposition from Images using Monte Carlo Rendering and Denoising
Jon Hasselgren
, Nikolai Hofmann,
Jacob Munkberg
NeurIPS
Variable Bitrate Neural Fields
Towaki Takikawa, Alex Evans,
Jonathan Tremblay
,
Thomas Müller
, Morgan McGuire, Alec Jacobson, Sanja Fidler
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
, Yuval Alaluf,
Yuval Atzmon
, Or Patashnik, Amit H. Bermano,
Gal Chechik
, Daniel Cohen-Or
ICLR
Top 25%
Instant Neural Graphics Primitives with a Multiresolution Hash Encoding
Thomas Müller
, Alex Evans, Christoph Schied,
Alex Keller
SIGGRAPH
Best Technical Paper, SIGGRAPH 2022
THE BEST INVENTIONS OF 2022, TIME
CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs
Jiteng Mu,
Shalini De Mello
,
Zhiding Yu
, Nuno Vasconcelos, Xiaolong Wang,
Sifei Liu
CVPR
Whose Track Is It Anyway? Improving Robustness to Tracking Errors with Affinity-Based Prediction
Xinshuo Weng
,
Boris Ivanovic
, Kris Kitani,
Marco Pavone
CVPR
ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning
Yuxiao Chen
,
Boris Ivanovic
,
Marco Pavone
Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps
Seung Wook Kim,
Karsten Kreis
, Daiqing Li, Antonio Torralba, Sanja Fidler
CVPR
GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras
Ye Yuan
,
Umar Iqbal
,
Pavlo Molchanov
, Kris Kitani,
Jan Kautz
CVPR
Ifor: Iterative flow minimization for robotic object rearrangement
Ankit Goyal
,
Arsalan Mousavian
, Chris Paxton,
Yu-Wei Chao
, Brian Okorn, Jia Deng,
Dieter Fox
GroupViT: Semantic Segmentation Emerges from Text Supervision
Jiarui Xu,
Shalini De Mello
,
Sifei Liu
,
Wonmin Byeon
,
Thomas Breuel
,
Jan Kautz
, Xiaolong Wang
CVPR
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Page
2
Current page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
…
Next page
Next ›
Last page
Last »