Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(36)
2024
(43)
2023
(39)
2022
(13)
2021
(7)
2020
(6)
2019
(1)
2018
(3)
Facet Publication Year
Research Areas
Generative AI
(150)
Artificial Intelligence and Machine Learning
(123)
Computer Vision
(62)
Computer Graphics
(41)
Robotics
(13)
Natural Language Processing
(10)
Circuits and VLSI Design
(9)
Speech Processing
(9)
Autonomous Vehicles
(8)
Applied Perception
(7)
Physical AI
(7)
VR, AR and Display Technology
(5)
Machine Translation
(4)
Climate Simulation
(3)
Algorithms and Numerical Methods
(2)
High Performance Computing
(2)
Computational Photography and Imaging
(1)
Computer Architecture
(1)
Esports
(1)
Human Computer Interaction
(1)
Resilience and Safety
(1)
Storage and Systems
(1)
World Simulation
(1)
Events
CVPR
(22)
ECCV
(4)
ICCV
(4)
ICLR
(16)
ICML
(8)
ICRA
(6)
NeurIPS
(25)
RSS
(1)
SIGGRAPH
(18)
150 results found
Generative AI
Clear all
Generative AI
2025
Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting
Salva Rühling Cachay,
Miika Aittala
,
Karsten Kreis
,
Noah Brenowitz
,
Arash Vahdat
,
Morteza Mardani
, Rose Yu
NeurIPS
Align Your Flow: Scaling Continuous-Time Flow Map Distillation
Amirmojtaba Sabour,
Sanja Fidler
,
Karsten Kreis
NeurIPS
ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning
Chi-Pin Huang, Yueh-Hua Wu,
Min-Hung Chen
,
Frank Wang
,
Fred Yang
NeurIPS
Seeing What Matters: Generalizable AI-generated Video Detection with Forensic-Oriented Augmentation
Riccardo Corvi, Davide Cozzolino,
Ekta Prashnani
,
Shalini De Mello
,
Koki Nagano
, Luisa Verdoliva
NeurIPS
Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail
Marco Pavone
, Many other contributors found on Page 33
VoiceNoNG: Robust High-Quality Speech Editing Model without Hallucinations
Sung-Feng Huang
, Heng-Cheng Kuo, Zhehuai Chen, Xuesong Yang, Pin-Jui Ku, Ante Jukić,
Huck Yang
, Yu Tsao,
Frank Wang
, Hung-yi Lee,
Szu-Wei Fu
Assessing Learned Models for Phase-only Hologram Compression
Zicong Peng, Yicheng Zhan,
Josef Spjut
, Kaan Akşit
SIGGRAPH
GAIA: Generative Animatable Interactive Avatars with Expression-conditioned Gaussians
Zhengming Yu,
Tianye Li
, Jingxiang Sun,
Omer Shapira
,
Seonwook Park
,
Michael Stengel
, Matthew Chan, Xin Li, Wenping Wang,
Koki Nagano
,
Shalini De Mello
SIGGRAPH
Fly, Fail, Fix: Iterative Game Repair with Reinforcement Learning and Large Multimodal Models
Alex Zook
,
Josef Spjut
,
Jonathan Tremblay
Identity-Motion Trade-offs in Text-to-Video Generation
Yuval Atzmon
, Rinon Gal,
Yoad Tewel
,
Yoni Kasten
,
Gal Chechik
FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scale
Boris Bonev
, Thorsten Kurth, Ankur Mahesh, Mauro Bisson,
Jean Kossaifi
, Karthik Kashinath, Anima Anandkumar, William D. Collins,
Mike Pritchard
,
Alex Keller
GenMol: A Drug Discovery Generalist with Discrete Diffusion
Seul Lee,
Karsten Kreis
, Srimukh Prasad Veccham, Meng Liu, Danny Reidenbach, Yuxing Peng, Saee Paliwal,
Weili Nie
,
Arash Vahdat
ICML
Efficient Molecular Conformer Generation with SO(3)-Averaged Flow Matching and Reflow
Zhonglin Cao, Mario Geiger, Allan Dos Santos Costa, Danny Reidenbach,
Karsten Kreis
,
Tomas Geffner
, Franco Pellegrini, Guoqing Zhou, Emine Kucukbenli
ICML
Score-based Diffusion Models in Function Space
Jae Hyun Lim,
Nikola Kovachki
, Ricardo Baptista, Christopher Beckham, Kamyar Azizzadenesheli,
Jean Kossaifi
, Vikram Voleti, Jiaming Song,
Karsten Kreis
,
Jan Kautz
, Christopher Pal,
Arash Vahdat
, Anima Anandkumar
Make It Count: Text-to-Image Generation with an Accurate Number of Objects
Lital Binyamin,
Yoad Tewel
, Eran Hirsch, Royi Rassin,
Gal Chechik
CVPR
Coherent 3D Portrait Video Reconstruction via Triplane Fusion
Shengze Wang,
Xueting Li
,
Chao Liu
, Matthew Chan,
Michael Stengel
, Henry Fuchs,
Shalini De Mello
,
Koki Nagano
CVPR
SimAvatar: Simulation-Ready Clothed Gaussian Avatars from Text
Xueting Li
,
Ye Yuan
,
Shalini De Mello
, Gilles Daviet, Jonathan Leaf, Miles Macklin,
Jan Kautz
,
Umar Iqbal
CVPR
A Generative AI Game Jam Case Study from October 2024
Josef Spjut
CVPR
Beyond the Buzz: A Pragmatic Take on Inference Disaggregation
Tiyasa Mitra, Ritika Borkar, Nidhi Bhatia, Ramon Matas, Shivam Raj, Dheevatsa Mudigere, Ritchie Zhao, Maximilian Golub, Arpan Dutta, Sailaja Madduri, Dharmesh Jani, Brian Pharris, Bita Darvish Rouhani
Inference-Time Policy Steering through Human Interactions
Yanwei Wang, Lirui Wang, Yilun Du,
Balakumar Sundaralingam
,
Xuning Yang
,
Yu-Wei Chao
,
Claudia Pérez D’Arpino
, Dieter Fox, Julie Shah
ICRA
Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond
Jessie Richter-Powell, Antonio Torralba, Jonathan Lorraine
ICML
Fugatto 1 - Foundational Generative Audio Transformer Opus 1
Rafael Valle, Rohan Badlani, Zhifeng Kong, Sang-gil Lee, Arushi Goel, Sungwon Kim, Joao Felipe Santos, Shuqi Dai,
Siddharth Gururani
, Aya AIJa'fari, Alex Liu, Kevin Shih, Wei Ping,
Huck Yang
, Bryan Catanzaro
ICLR
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation
Alexander H. Liu, Sang-gil Lee,
Huck Yang
, Yuan Gong,
Frank Wang
, James R. Glas, Rafael Valle
ICLR
Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models,
Dvir Samuel, Barak Meiri,
Haggai Maron
,
Yoad Tewel
, Nir Darshan, Shai Avidan,
Gal Chechik
, Rami Ben-Ari
ICLR
Cosmos Transfer 1: World-to-World Transfer with Adaptive Multi-Control for Physical AI
Ming-Yu Liu
Cosmos-Reason 1: From Physical AI Common Sense to Embodied Decisions
Tsung-Yi Lin
,
Ming-Yu Liu
NVIDIA Isaac GR00T N1: An Open Foundation Model for Humanoid Robots
Yuke Zhu
,
Linxi "Jim" Fan
, NVIDIA GEAR Team
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Zhengyi Wang, Jonathan Lorraine, Yikai Wang, Hang Su, Jun Zhu, Sanja Fidler,
Xiaohui Zeng
Multi-student Diffusion Distillation for Better One-step Generators
Yanke Song, Jonathan Lorraine,
Weili Nie
,
Karsten Kreis
, James Lucas
ICML
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models
Kuan-Hung Liu, Cheng-Kun Yang,
Min-Hung Chen
, Yu-Lun Liu, Yen-Yu Lin
Energy-Based Diffusion Language Models for Text Generation
Minkai Xu,
Tomas Geffner
,
Karsten Kreis
,
Weili Nie
, Yilun Xu, Jure Leskovec, Stefano Ermon,
Arash Vahdat
ICLR
Truncated Consistency Models
Sangyun Lee, Yilun Xu,
Tomas Geffner
, Giulia Fanti,
Karsten Kreis
,
Arash Vahdat
,
Weili Nie
ICLR
Pagination
Current page
1
Page
2
Page
3
Page
4
Page
5
Next page
Next ›
Last page
Last »