Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(4)
2024
(18)
2023
(46)
2022
(52)
2021
(31)
2020
(48)
2019
(37)
2018
(43)
2017
(19)
2016
(6)
2015
(7)
2014
(2)
2013
(3)
2012
(3)
2011
(1)
2010
(1)
Facet Publication Year
Research Areas
Computer Vision
(18)
Generative AI
(8)
Computer Graphics
(5)
Robotics
(5)
Artificial Intelligence and Machine Learning
(4)
Natural Language Processing
(3)
Applied Perception
(2)
Autonomous Vehicles
(1)
Speech Processing
(1)
VR, AR and Display Technology
(1)
Events
CVPR
(5)
ECCV
(1)
ICLR
(2)
ICML
(1)
ICRA
(1)
IROS
(1)
NeurIPS
(1)
RSS
(1)
SIGGRAPH
(2)
18 results found
Computer Vision
Clear all
2024
Computer Vision
2024
Fast Encoder-Based 3D from Casual Videos via Point Track Processing
Yoni Kasten
, Wuyue Lu,
Haggai Maron
NeurIPS
Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities
Siyin Wang,
Huck Yang
, Ji Wu, Chao Zhang
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment
Yusuke Hirota,
Ryo Hachiuma
,
Huck Yang
, Yuta Nakashima
Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning
Jishnu Jaykumar P, Kamalesh Palanisamy,
Yu-Wei Chao
, Xinya Du, Yu Xiang
IROS
Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models
Alexander Popov, Alperen Degirmenci, David Wehr, Shashank Hegde , Ryan Oldja, Alexey Kamenev, Bertrand Douillard, David Nistér, Urs Muller, Ruchi Bhargava,
Stan Birchfield
, Nikolai Smolyanskiy
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models
Gilad Deutch,
Rinon Gal
, Daniel Garibi, Or Patashnik, Daniel Cohen-Or
SIGGRAPH
DoRA: Weight-Decomposed Low-Rank Adaptation
Shih-Yang Liu, Chien-Yi Wang,
Hongxu Danny Yin
,
Pavlo Molchanov
,
Frank Wang
, Kwang-Ting Cheng,
Min-Hung Chen
ICML
RVT-2: Learning Precise Manipulation from Few Examples
Ankit Goyal
,
Valts Blukis
,
Jie Xu
,
Yijie Guo
,
Yu-Wei Chao
,
Dieter Fox
RSS
Breathing Life Into Sketches Using Text-to-Video Priors
Rinon Gal
, Yael Vinker, Yuval Alaluf, Amit Bermano, Daniel Cohen-Or, Ariel Shamir,
Gal Chechik
CVPR
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling, Seung Wook Kim, Antonio Torralba, Sanja Fidler,
Karsten Kreis
CVPR
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Bowen Wen
,
Wei Yang
,
Jan Kautz
,
Stan Birchfield
CVPR
NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows
Zhenggang Tang, Zhongzheng Ren, Xiaoming Zhao,
Bowen Wen
,
Jonathan Tremblay
,
Stan Birchfield
, Alexander Schwing
CVPR
Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects
Yijia Weng,
Bowen Wen
,
Jonathan Tremblay
,
Valts Blukis
,
Dieter Fox
, Leo Guibas,
Stan Birchfield
CVPR
SynH2R: Synthesizing Hand-Object Motions for Learning Human-to-Robot Handovers
Sammy Christen, Lan Feng,
Wei Yang
,
Yu-Wei Chao
, Otmar Hilliges, Jie Song
ICRA
FasterViT: Fast Vision Transformers with Hierarchical Attention
Ali Hatamizadeh
,
Greg Heinrich
,
Hongxu Danny Yin
, Andrew Tao, Jose M. Alvarez,
Jan Kautz
,
Pavlo Molchanov
ICLR
WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space
Katja Schwarz, Seung Wook Kim, Jun Gao, Sanja Fidler, Andreas Geiger,
Karsten Kreis
ICLR
LCM-Lookahead for Encoder-based Text-to-Image Personalization
Rinon Gal
, Or Lichter, Elad Richardson, Or Patashnik, Amit H Bermano,
Gal Chechik
, Daniel Cohen-Or
ECCV
Consolidating Attention Features for Multi-view Image Editing
Or Patashnik,
Rinon Gal
, Daniel Cohen-Or, Jun-Yan Zhu, Fernando De la Torre
SIGGRAPH