Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(4)
2024
(4)
2023
(33)
2022
(28)
2021
(18)
2020
(33)
2019
(22)
2018
(27)
2017
(6)
2016
(3)
2015
(2)
Facet Publication Year
Research Areas
Artificial Intelligence and Machine Learning
(180)
Computer Vision
(180)
Generative AI
(40)
Computer Graphics
(30)
Computational Photography and Imaging
(11)
Autonomous Vehicles
(10)
Human Computer Interaction
(9)
Robotics
(9)
Applied Perception
(5)
Real-Time Rendering
(5)
Hyperscale Graphics
(4)
Resilience and Safety
(4)
Natural Language Processing
(3)
Esports
(2)
High Performance Computing
(2)
Medical
(2)
VR, AR and Display Technology
(2)
Events
CORL
(3)
CVPR
(20)
ECCV
(4)
ICCV
(3)
ICLR
(8)
ICML
(2)
ICRA
(2)
NeurIPS
(12)
RSS
(1)
SIGGRAPH
(5)
180 results found
Artificial Intelligence and Machine Learning
Computer Vision
Clear all
Artificial Intelligence and Machine Learning
Computer Vision
2025
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Ali Hatamizadeh
,
Jan Kautz
CVPR
Spatio-Temporal Context Prompting for Zero-Shot Action Detection
Wei-Jhe Huang,
Min-Hung Chen
, Shang-Hong Lai
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Ci-Siang Lin, Chien-Yi Wang,
Frank Wang
,
Min-Hung Chen
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models
Kuan-Hung Liu, Cheng-Kun Yang,
Min-Hung Chen
, Yu-Lun Liu, Yen-Yu Lin
2024
DoRA: Weight-Decomposed Low-Rank Adaptation
Shih-Yang Liu, Chien-Yi Wang,
Hongxu Danny Yin
,
Pavlo Molchanov
,
Frank Wang
, Kwang-Ting Cheng,
Min-Hung Chen
ICML
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling, Seung Wook Kim, Antonio Torralba, Sanja Fidler,
Karsten Kreis
CVPR
FasterViT: Fast Vision Transformers with Hierarchical Attention
Ali Hatamizadeh
,
Greg Heinrich
,
Hongxu Danny Yin
, Andrew Tao, Jose M. Alvarez,
Jan Kautz
,
Pavlo Molchanov
ICLR
WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space
Katja Schwarz, Seung Wook Kim, Jun Gao, Sanja Fidler, Andreas Geiger,
Karsten Kreis
ICLR
2023
Point-Cloud Completion with Pretrained Text-to-image Diffusion Models
Yoni Kasten
, Ohad Rahamim,
Gal Chechik
NeurIPS
SceneScape: Text-Driven Consistent Scene Generation
Rafail Fridman, Amit Abecasis,
Yoni Kasten
, Tali Dekel
NeurIPS
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
Xuanchi Ren, Jiahui Huang, Xiaohui Zeng, Ken Museth, Sanja Fidler, Francis Williams
CVPR
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Daiqing Li, Huan Ling, Amlan Kar, David Acuna, Seung Wook Kim,
Karsten Kreis
, Antonio Torralba, Sanja Fidler
ICCV
2D-3D Interlaced Transformer for Point Cloud Segmentation with Scene-Level Supervision
Cheng-Kun Yang,
Min-Hung Chen
, Yung-Yu Chaung, Yen-Yu Lin
ICCV
ATT3D: Amortized Text-To-3D Object Synthesis
Jonathan Lorraine, Kevin Xie, Xiaohui Zeng,
Chen-Hsuan Lin
, Towaki Takikawa, Nicholas Sharp,
Tsung-Yi Lin
,
Ming-Yu Liu
, Sanja Fidler, James Lucas
ICCV
Syntactic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment
Royi Rassin, Eran Hirsch, Daniel Glickman, Shauli Ravfogel, Yoav Goldberg,
Gal Chechik
NeurIPS
Oral presentation
Norm-guided latent space exploration for text-to-image generation
Dvir Samuel, Rami Ben-Ari, Nir Darshan,
Haggai Maron
,
Gal Chechik
NeurIPS
Online Overexposed Pixels Hallucination in Videos with Adaptive Reference Frame Selection
Yazhou Xing,
Amrita Mazumdar
, Anjul Patney,
Chao Liu
,
Hongxu Danny Yin
, Qifeng Chen,
Jan Kautz
,
Iuri Frosio
CORL
Differentially Private Diffusion Models
Tim Dockhorn, Tianshi Cao,
Arash Vahdat
,
Karsten Kreis
Flexible Isosurface Extraction for Gradient-Based Mesh Optimization
Tianchang Shen,
Jacob Munkberg
,
Jon Hasselgren
, Kangxue Yin, Zian Wang, Wenzheng Chen, Zan Gojcic, Sanja Fidler, Nicholas Sharp, Jun Gao
SIGGRAPH
Learning Physically Simulated Tennis Players from Broadcast Videos
Haotian Zhang,
Ye Yuan
, Viktor Makoviychuk, Yunrong Guo, Sanja Fidler, Jason Peng, Kayvon Fatahalian
SIGGRAPH
Live 3D Portrait: Real-Time Radiance Fields for Single-Image Portrait View Synthesis
Alexander Trevithick, Matthew Chan,
Michael Stengel
, Eric R. Chan,
Chao Liu
,
Zhiding Yu
, Sameh Khamis, Manmohan Chandraker, Ravi Ramamoorthi,
Koki Nagano
SIGGRAPH
SSIF: Single-shot Implicit Morphable Faces With Consistent Texture Parameterization
Connor Zhizhen Lin,
Koki Nagano
,
Jan Kautz
, Eric R. Chan,
Umar Iqbal
, Leonidas Guibas, Gordon Wetzstein, Sameh Khamis
SIGGRAPH
Global Context Vision Transformers
Ali Hatamizadeh
,
Hongxu Danny Yin
,
Greg Heinrich
,
Jan Kautz
,
Pavlo Molchanov
ICML
Task-Aware Risk Estimation of Perception Failures for Autonomous Vehicles
Pasquale Antonante,
Sushant Veer
,
Karen Leung
,
Xinshuo Weng
, Luca Carlone,
Marco Pavone
RSS
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
Andreas Blattmann, Robin Rombach, Huan Ling, Tim Dockhorn, Seung Wook Kim, Sanja Fidler,
Karsten Kreis
CVPR
FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization
Jiawei Yang,
Marco Pavone
,
Yue Wang
CVPR
Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation
Heng Yang
,
Marco Pavone
CVPR
Selected as a Highlight Paper
NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models
Seung Wook Kim, Bradley Brown, Kangxue Yin,
Karsten Kreis
, Katja Schwarz, Daiqing Li, Robin Rombach, Antonio Torralba, Sanja Fidler
CVPR
Learning Human-to-Robot Handovers from Point Clouds
Sammy Christen,
Wei Yang
,
Claudia Pérez D’Arpino
, Otmar Hilliges,
Dieter Fox
,
Yu-Wei Chao
CVPR
Highlight
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
Jiarui Xu,
Sifei Liu
,
Arash Vahdat
,
Wonmin Byeon
, Xiaolong Wang,
Shalini De Mello
CVPR
Hightlight top 10%
Zero-shot Pose Transfer for Unrigged Stylized 3D Characters
Jiashun Wang,
Xueting Li
,
Sifei Liu
,
Shalini De Mello
,
Orazio Gallo
, Xiaolong Wang,
Jan Kautz
CVPR
Neuralangelo: High-Fidelity Neural Surface Reconstruction
Max Zhaoshuo Li
,
Thomas Müller
, Alex Evans, Russell H. Taylor, Mathias Unberath,
Ming-Yu Liu
,
Chen-Hsuan Lin
CVPR
The Best Inventions of 2023, TIME Magazine
Pagination
Current page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Next page
Next ›
Last page
Last »