Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(9)
2024
(22)
2023
(46)
2022
(52)
2021
(31)
2020
(48)
2019
(37)
2018
(43)
2017
(19)
2016
(6)
2015
(7)
2014
(2)
2013
(3)
2012
(3)
2011
(1)
2010
(1)
Facet Publication Year
Research Areas
Computer Vision
(74)
Artificial Intelligence and Machine Learning
(36)
Generative AI
(19)
Robotics
(14)
Autonomous Vehicles
(12)
Computer Graphics
(11)
Applied Perception
(3)
Computational Photography and Imaging
(3)
Hyperscale Graphics
(3)
Natural Language Processing
(3)
Real-Time Rendering
(3)
Resilience and Safety
(3)
Algorithms and Numerical Methods
(2)
High Performance Computing
(2)
VR, AR and Display Technology
(2)
Speech Processing
(1)
Events
CORL
(4)
CVPR
(15)
ECCV
(6)
ICLR
(6)
ICML
(1)
ICRA
(6)
IROS
(2)
NeurIPS
(8)
RSS
(1)
SIGGRAPH
(3)
74 results found
Computer Vision
Clear all
2024
2022
Computer Vision
2024
L4GM: Large 4D Gaussian Reconstruction Model
Jiawei Ren, Kevin Xie, Ashkan Mirzaei, Hanxue Liang, Xiaohui Zeng,
Karsten Kreis
, Ziwei Liu, Antonio Torralba, Sanja Fidler, Seung Wook Kim, Huan Ling
NeurIPS
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
Giannis Daras,
Weili Nie
,
Karsten Kreis
, Alexandros G. Dimakis,
Morteza Mardani
,
Nikola Kovachki
,
Arash Vahdat
NeurIPS
Fast Encoder-Based 3D from Casual Videos via Point Track Processing
Yoni Kasten
, Wuyue Lu,
Haggai Maron
NeurIPS
Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities
Siyin Wang,
Huck Yang
, Ji Wu, Chao Zhang
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment
Yusuke Hirota,
Ryo Hachiuma
,
Huck Yang
, Yuta Nakashima
ReMatching Dynamic Reconstruction Flow
Sara Oblak, Despoina Paschalidou, Sanja Fidler, Matan Atzmon
ICLR
Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning
Jishnu Jaykumar P, Kamalesh Palanisamy,
Yu-Wei Chao
, Xinya Du, Yu Xiang
IROS
Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models
Alexander Popov, Alperen Degirmenci, David Wehr, Shashank Hegde , Ryan Oldja, Alexey Kamenev, Bertrand Douillard, David Nistér, Urs Muller, Ruchi Bhargava,
Stan Birchfield
, Nikolai Smolyanskiy
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models
Gilad Deutch,
Rinon Gal
, Daniel Garibi, Or Patashnik, Daniel Cohen-Or
SIGGRAPH
DoRA: Weight-Decomposed Low-Rank Adaptation
Shih-Yang Liu, Chien-Yi Wang,
Hongxu Danny Yin
,
Pavlo Molchanov
,
Frank Wang
, Kwang-Ting Cheng,
Min-Hung Chen
ICML
RVT-2: Learning Precise Manipulation from Few Examples
Ankit Goyal
,
Valts Blukis
,
Jie Xu
,
Yijie Guo
,
Yu-Wei Chao
, Dieter Fox
RSS
Breathing Life Into Sketches Using Text-to-Video Priors
Rinon Gal
, Yael Vinker, Yuval Alaluf, Amit Bermano, Daniel Cohen-Or, Ariel Shamir,
Gal Chechik
CVPR
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling, Seung Wook Kim, Antonio Torralba, Sanja Fidler,
Karsten Kreis
CVPR
Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata
Dongsu Zhang, Francis Williams, Zan Gojcic,
Karsten Kreis
, Sanja Fidler, Young Min Kim, Amlan Kar
CVPR
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Bowen Wen
,
Wei Yang
,
Jan Kautz
,
Stan Birchfield
CVPR
NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows
Zhenggang Tang, Zhongzheng Ren, Xiaoming Zhao,
Bowen Wen
,
Jonathan Tremblay
,
Stan Birchfield
, Alexander Schwing
CVPR
Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects
Yijia Weng,
Bowen Wen
,
Jonathan Tremblay
,
Valts Blukis
, Dieter Fox, Leo Guibas,
Stan Birchfield
CVPR
SynH2R: Synthesizing Hand-Object Motions for Learning Human-to-Robot Handovers
Sammy Christen, Lan Feng,
Wei Yang
,
Yu-Wei Chao
, Otmar Hilliges, Jie Song
ICRA
FasterViT: Fast Vision Transformers with Hierarchical Attention
Ali Hatamizadeh
,
Greg Heinrich
,
Hongxu Danny Yin
, Andrew Tao, Jose M. Alvarez,
Jan Kautz
,
Pavlo Molchanov
ICLR
WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space
Katja Schwarz, Seung Wook Kim, Jun Gao, Sanja Fidler, Andreas Geiger,
Karsten Kreis
ICLR
LCM-Lookahead for Encoder-based Text-to-Image Personalization
Rinon Gal
, Or Lichter, Elad Richardson, Or Patashnik, Amit H Bermano,
Gal Chechik
, Daniel Cohen-Or
ECCV
Consolidating Attention Features for Multi-view Image Editing
Or Patashnik,
Rinon Gal
, Daniel Cohen-Or, Jun-Yan Zhu, Fernando De la Torre
SIGGRAPH
2022
Learning Robust Real-World Dexterous Grasping Policies via Implicit Shape Augmentation
Qiuyu Chen, Karl Van Wyk,
Yu-Wei Chao
,
Wei Yang
, Arsalan Mousavian, Abhishek Gupta, Dieter Fox
CORL
Task-Relevant Failure Detection for Trajectory Predictors in Autonomous Vehicles
Alec Farid,
Sushant Veer
,
Boris Ivanovic
,
Karen Leung
,
Marco Pavone
CORL
Robust Trajectory Prediction against Adversarial Attacks
Yulong Cao
,
Danfei Xu
,
Xinshuo Weng
, Z. Morely Mao, Anima Anandkumar,
Chaowei Xiao
,
Marco Pavone
CORL
Selected for Oral Presentation
MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare
Yann Labbe, Lucas Manuelli, Arsalan Mousavian,
Stephen Tyree
,
Stan Birchfield
,
Jonathan Tremblay
, et al.
CORL
Motion Policy Networks
Adam Fishman,
Adithya Murali
, Clemens Eppner, Bryan Peele, Byron Boots, Dieter Fox
"This is my unicorn, Fluffy": Personalizing frozen vision-language representations
Niv Cohen,
Rinon Gal
,
Eli Meirom
,
Gal Chechik
,
Yuval Atzmon
ECCV
Paraphrasing Is All You Need for Novel Object Captioning
Cheng-Fu Yang, Yao-Hung Hubert Tsai, Wan-Cyuan Fan, Ruslan Salakhutdinov, Louis-Philippe Morency,
Frank Wang
NeurIPS
Structural Pruning via Latency-Saliency Knapsack
Maying Shen,
Hongxu Danny Yin
,
Pavlo Molchanov
, Lei Mao, Jianna Liu, Jose M. Alvarez
Embodied Scene-aware Human Pose Estimation
Zhengyi Luo, Shun Iwase,
Ye Yuan
, Kris Kitani
NeurIPS
SPoVT: Semantic-Prototype Variational Transformer for Dense Point Cloud Semantic Completion
Sheng-Yu Huang, Hao-Yu Hsu,
Yu-Chiang Frank Wang
NeurIPS
Pagination
Current page
1
Page
2
Page
3
Next page
Next ›
Last page
Last »