Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(29)
2024
(28)
2023
(48)
2022
(52)
2021
(31)
2020
(48)
2019
(37)
2018
(43)
2017
(19)
2016
(6)
2015
(7)
2014
(2)
2013
(3)
2012
(3)
2011
(1)
2010
(1)
Facet Publication Year
Research Areas
Computer Vision
(359)
Artificial Intelligence and Machine Learning
(211)
Generative AI
(62)
Robotics
(57)
Computer Graphics
(52)
Autonomous Vehicles
(20)
Computational Photography and Imaging
(20)
VR, AR and Display Technology
(19)
Human Computer Interaction
(15)
Applied Perception
(12)
Medical
(7)
Real-Time Rendering
(7)
Natural Language Processing
(6)
Resilience and Safety
(6)
Algorithms and Numerical Methods
(5)
Hyperscale Graphics
(5)
Physical AI
(4)
High Performance Computing
(3)
Esports
(2)
World Simulation
(2)
Computer Architecture
(1)
Speech Processing
(1)
Events
CORL
(6)
CVPR
(55)
ECCV
(7)
ICCV
(8)
ICLR
(12)
ICML
(3)
ICRA
(17)
IROS
(7)
NeurIPS
(23)
RSS
(3)
SIGGRAPH
(11)
359 results found
Computer Vision
Clear all
Computer Vision
2025
Play4D: Accelerated and Interactive Free-viewpoint Video Streaming for Virtual Reality and Light Field Displays
Jonghyun Kim
,
Michael Stengel
,
Amrita Mazumdar
,
Tianye Li
,
Cheng Sun
,
David Luebke
,
Shalini De Mello
SIGGRAPH
RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion
Bardienus P. Duisterhof, Jan Oberst,
Bowen Wen
,
Stan Birchfield
, Deva Ramanan, Jeffrey Ichnowski
NeurIPS
Attention on the Sphere
Boris Bonev
, Max Rietmann, Andrea Paris, Alberto Carpentieri, Thorsten Kurth
NeurIPS
Seeing What Matters: Generalizable AI-generated Video Detection with Forensic-Oriented Augmentation
Riccardo Corvi, Davide Cozzolino,
Ekta Prashnani
,
Shalini De Mello
,
Koki Nagano
, Luisa Verdoliva
NeurIPS
Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail
Marco Pavone
, Many other contributors found on Page 33
Task-Oriented Human Grasp Synthesis via Context- and Task-Aware Diffusers
An-Lun Liu,
Yu-Wei Chao
, Yi-Ting Chen
ICCV
Pedestrian Collision Avoidance in Hemianopia during Natural Walking in Immersive Virtual Reality
Jonathan K. Doyon, Sujin Kim, Alex D. Hwang,
Jae-Hyun Jung
Real-time 3D Visualization of Radiance Fields on Light Field Displays
Jonghyun Kim
,
Cheng Sun
,
Michael Stengel
, Matthew Chan, Andrew Russell, Jaehyun Jung, Wil Braithewaite,
Shalini De Mello
,
David Luebke
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
Xuanchi Ren, Tianchang Shen, Jiahui Huang, Huan Ling, Yifan Lu,
Merlin Nimier-David
,
Thomas Müller
,
Alex Keller
,
Sanja Fidler
, Jun Gao
CVPR
Radiance Surfaces: Optimizing Surface Representations with a 5D Radiance Field Loss
Ziyi Zhang, Nicolas Roussel,
Thomas Müller
,
Tizian Zeltner
,
Merlin Nimier-David
,
Fabrice Rousselle
, Wenzel Jakob
SIGGRAPH
Identity-Motion Trade-offs in Text-to-Video Generation
Yuval Atzmon
, Rinon Gal,
Yoad Tewel
,
Yoni Kasten
,
Gal Chechik
FoundationStereo: Zero-Shot Stereo Matching
Bowen Wen
, Matthew Trepte, Joseph Aribido,
Jan Kautz
, Orazio Gallo,
Stan Birchfield
CVPR
Best Paper Nomination
Adapting to the Unknown: Training-Free Audio-Visual Event Perception with Dynamic Thresholds
Eitan Shaar, Ariel Shaulov,
Gal Chechik
, Lior Wolf
CVPR
RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression
Uri Gadot, Assaf Shocher,
Shie Mannor
,
Gal Chechik
,
Assaf Hallak
CVPR
TriTex: Learning Texture from a Single Mesh via Triplane Semantic Features
Dana Cohen-Bar, Daniel Cohen-Or,
Gal Chechik
,
Yoni Kasten
CVPR
BLADE: Single-view Body Mesh Estimation through Accurate Depth Estimation
Shengze Wang,
Jiefeng Li
,
Tianye Li
,
Ye Yuan
, Henry Fuchs,
Koki Nagano
,
Shalini De Mello
,
Michael Stengel
CVPR
Coherent 3D Portrait Video Reconstruction via Triplane Fusion
Shengze Wang,
Xueting Li
,
Chao Liu
, Matthew Chan,
Michael Stengel
, Henry Fuchs,
Shalini De Mello
,
Koki Nagano
CVPR
SimAvatar: Simulation-Ready Clothed Gaussian Avatars from Text
Xueting Li
,
Ye Yuan
,
Shalini De Mello
, Gilles Daviet, Jonathan Leaf, Miles Macklin,
Jan Kautz
,
Umar Iqbal
CVPR
GRS: Generating robotic simulation tasks from real-world images
Alex Zook
,
Josef Spjut
,
Jonathan Tremblay
CVPR
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Ali Hatamizadeh
,
Jan Kautz
CVPR
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
Chan Hee Song,
Valts Blukis
,
Jonathan Tremblay
,
Stephen Tyree
, Yu Su,
Stan Birchfield
CVPR
SPOT: SE(3) Pose Trajectory Diffusion for Object-Centric Manipulation
Cheng-Chun Hsu,
Bowen Wen
,
Jie Xu
,
Yashraj Narang
, ,
Yuke Zhu
, Joydeep Biswas,
Stan Birchfield
ICRA
AI 3D Selfie: Real-Time Single-Image 3D Face Reconstruction for Light-Field Displays
Jonghyun Kim
,
Michael Stengel
, Matthew Chan,
Koki Nagano
,
Shalini De Mello
,
David Luebke
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Yukang Chen
, Fuzhao Xue, Dacheng Li, Qinghao Hu,
Ligeng Zhu
, Xiuyu Li, Yunhao Fang, Haotian Tang, Shang Yang, Zhijian Liu, Ethan He, Hongxu Yin,
Pavlo Molchanov
,
Jan Kautz
, Linxi Fan,
Yuke Zhu
, Yao Lu (Jason),
Song Han
ICLR
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Zhengyi Wang, Jonathan Lorraine, Yikai Wang, Hang Su, Jun Zhu, Sanja Fidler,
Xiaohui Zeng
Multi-student Diffusion Distillation for Better One-step Generators
Yanke Song, Jonathan Lorraine,
Weili Nie
,
Karsten Kreis
, James Lucas
ICML
Spatio-Temporal Context Prompting for Zero-Shot Action Detection
Wei-Jhe Huang,
Min-Hung Chen
, Shang-Hong Lai
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Ci-Siang Lin, Chien-Yi Wang,
Frank Wang
,
Min-Hung Chen
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models
Kuan-Hung Liu, Cheng-Kun Yang,
Min-Hung Chen
, Yu-Lun Liu, Yen-Yu Lin
2024
L4GM: Large 4D Gaussian Reconstruction Model
Jiawei Ren, Kevin Xie, Ashkan Mirzaei, Hanxue Liang, Xiaohui Zeng,
Karsten Kreis
, Ziwei Liu, Antonio Torralba, Sanja Fidler, Seung Wook Kim, Huan Ling
NeurIPS
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
Giannis Daras,
Weili Nie
,
Karsten Kreis
, Alexandros G. Dimakis,
Morteza Mardani
,
Nikola Kovachki
,
Arash Vahdat
NeurIPS
QUEEN: QUantized Efficient ENcoding for Streaming Free-viewpoint Videos
Sharath Girish,
Tianye Li
,
Amrita Mazumdar
, Abhinav Shrivastava,
David Luebke
,
Shalini De Mello
NeurIPS
Pagination
Current page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
…
Next page
Next ›
Last page
Last »