Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(8)
2024
(15)
2023
(19)
2022
(8)
2021
(5)
2020
(4)
2018
(2)
Facet Publication Year
Research Areas
Computer Vision
(62)
Generative AI
(62)
Artificial Intelligence and Machine Learning
(54)
Computer Graphics
(16)
Natural Language Processing
(3)
Applied Perception
(2)
VR, AR and Display Technology
(2)
Algorithms and Numerical Methods
(1)
Autonomous Vehicles
(1)
Computational Photography and Imaging
(1)
Physical AI
(1)
Robotics
(1)
Events
CVPR
(18)
ECCV
(2)
ICCV
(2)
ICLR
(7)
ICML
(2)
ICRA
(1)
NeurIPS
(12)
SIGGRAPH
(4)
62 results found
Computer Vision
Generative AI
Clear all
Computer Vision
Generative AI
2025
Seeing What Matters: Generalizable AI-generated Video Detection with Forensic-Oriented Augmentation
Riccardo Corvi, Davide Cozzolino,
Ekta Prashnani
,
Shalini De Mello
,
Koki Nagano
, Luisa Verdoliva
NeurIPS
Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail
Marco Pavone
, Many other contributors found on Page 33
Identity-Motion Trade-offs in Text-to-Video Generation
Yuval Atzmon
, Rinon Gal,
Yoad Tewel
,
Yoni Kasten
,
Gal Chechik
Coherent 3D Portrait Video Reconstruction via Triplane Fusion
Shengze Wang,
Xueting Li
,
Chao Liu
, Matthew Chan,
Michael Stengel
, Henry Fuchs,
Shalini De Mello
,
Koki Nagano
CVPR
SimAvatar: Simulation-Ready Clothed Gaussian Avatars from Text
Xueting Li
,
Ye Yuan
,
Shalini De Mello
, Gilles Daviet, Jonathan Leaf, Miles Macklin,
Jan Kautz
,
Umar Iqbal
CVPR
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Zhengyi Wang, Jonathan Lorraine, Yikai Wang, Hang Su, Jun Zhu, Sanja Fidler,
Xiaohui Zeng
Multi-student Diffusion Distillation for Better One-step Generators
Yanke Song, Jonathan Lorraine,
Weili Nie
,
Karsten Kreis
, James Lucas
ICML
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models
Kuan-Hung Liu, Cheng-Kun Yang,
Min-Hung Chen
, Yu-Lun Liu, Yen-Yu Lin
2024
L4GM: Large 4D Gaussian Reconstruction Model
Jiawei Ren, Kevin Xie, Ashkan Mirzaei, Hanxue Liang, Xiaohui Zeng,
Karsten Kreis
, Ziwei Liu, Antonio Torralba, Sanja Fidler, Seung Wook Kim, Huan Ling
NeurIPS
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
Giannis Daras,
Weili Nie
,
Karsten Kreis
, Alexandros G. Dimakis,
Morteza Mardani
,
Nikola Kovachki
,
Arash Vahdat
NeurIPS
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment
Yusuke Hirota,
Ryo Hachiuma
,
Huck Yang
, Yuta Nakashima
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models
Gilad Deutch, Rinon Gal, Daniel Garibi, Or Patashnik, Daniel Cohen-Or
SIGGRAPH
DoRA: Weight-Decomposed Low-Rank Adaptation
Shih-Yang Liu, Chien-Yi Wang,
Hongxu Danny Yin
,
Pavlo Molchanov
,
Frank Wang
, Kwang-Ting Cheng,
Min-Hung Chen
ICML
Breathing Life Into Sketches Using Text-to-Video Priors
Rinon Gal, Yael Vinker, Yuval Alaluf, Amit Bermano, Daniel Cohen-Or, Ariel Shamir,
Gal Chechik
CVPR
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling, Seung Wook Kim, Antonio Torralba, Sanja Fidler,
Karsten Kreis
CVPR
Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata
Dongsu Zhang, Francis Williams, Zan Gojcic,
Karsten Kreis
, Sanja Fidler, Young Min Kim, Amlan Kar
CVPR
What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs
Alexander Trevithick
, Matthew Chan, Towaki Takikawa,
Umar Iqbal
,
Shalini De Mello
, Manmohan Chandraker, Ravi Ramamoorthi,
Koki Nagano
CVPR
RegionGPT: Towards Region Understanding Vision Language Model
Qiushan Guo,
Shalini De Mello
,
Hongxu Danny Yin
,
Wonmin Byeon
, Ka Chun Cheung, Yizhou Yu, Ping Luo,
Sifei Liu
CVPR
GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning
Ye Yuan
,
Xueting Li
, Yangyi Huang,
Shalini De Mello
,
Koki Nagano
,
Jan Kautz
,
Umar Iqbal
CVPR
Highlight
WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space
Katja Schwarz, Seung Wook Kim, Jun Gao, Sanja Fidler, Andreas Geiger,
Karsten Kreis
ICLR
3D Reconstruction with Generalizable Neural Fields using Scene Priors
Yang Fu,
Shalini De Mello
,
Xueting Li
, Amey Kulkarni,
Jan Kautz
, Xiaolong Wang,
Sifei Liu
ICLR
LCM-Lookahead for Encoder-based Text-to-Image Personalization
Rinon Gal, Or Lichter, Elad Richardson, Or Patashnik, Amit H Bermano,
Gal Chechik
, Daniel Cohen-Or
ECCV
Consolidating Attention Features for Multi-view Image Editing
Or Patashnik, Rinon Gal, Daniel Cohen-Or, Jun-Yan Zhu, Fernando De la Torre
SIGGRAPH
2023
Point-Cloud Completion with Pretrained Text-to-image Diffusion Models
Yoni Kasten
, Ohad Rahamim,
Gal Chechik
NeurIPS
SceneScape: Text-Driven Consistent Scene Generation
Rafail Fridman, Amit Abecasis,
Yoni Kasten
, Tali Dekel
NeurIPS
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
Xuanchi Ren, Jiahui Huang, Xiaohui Zeng, Ken Museth, Sanja Fidler, Francis Williams
CVPR
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Daiqing Li, Huan Ling, Amlan Kar, David Acuna, Seung Wook Kim,
Karsten Kreis
, Antonio Torralba, Sanja Fidler
ICCV
ATT3D: Amortized Text-To-3D Object Synthesis
Jonathan Lorraine, Kevin Xie, Xiaohui Zeng,
Chen-Hsuan Lin
, Towaki Takikawa, Nicholas Sharp,
Tsung-Yi Lin
,
Ming-Yu Liu
, Sanja Fidler, James Lucas
ICCV
Syntactic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment
Royi Rassin, Eran Hirsch, Daniel Glickman, Shauli Ravfogel, Yoav Goldberg,
Gal Chechik
NeurIPS
Oral presentation
Norm-guided latent space exploration for text-to-image generation
Dvir Samuel, Rami Ben-Ari, Nir Darshan,
Haggai Maron
,
Gal Chechik
NeurIPS
Differentially Private Diffusion Models
Tim Dockhorn, Tianshi Cao,
Arash Vahdat
,
Karsten Kreis
Flexible Isosurface Extraction for Gradient-Based Mesh Optimization
Tianchang Shen,
Jacob Munkberg
,
Jon Hasselgren
, Kangxue Yin, Zian Wang, Wenzheng Chen, Zan Gojcic, Sanja Fidler, Nicholas Sharp, Jun Gao
SIGGRAPH
Pagination
Current page
1
Page
2
Next page
Next ›
Last page
Last »