Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(25)
2024
(26)
2023
(34)
2022
(11)
2021
(7)
2020
(6)
2019
(1)
2018
(3)
Facet Publication Year
Research Areas
Artificial Intelligence and Machine Learning
(113)
Generative AI
(113)
Computer Vision
(46)
Computer Graphics
(29)
Robotics
(8)
Applied Perception
(7)
Natural Language Processing
(7)
Speech Processing
(7)
Autonomous Vehicles
(6)
Circuits and VLSI Design
(6)
Physical AI
(5)
Climate Simulation
(3)
Machine Translation
(3)
VR, AR and Display Technology
(3)
High Performance Computing
(2)
Algorithms and Numerical Methods
(1)
Computational Photography and Imaging
(1)
Computer Architecture
(1)
Esports
(1)
Human Computer Interaction
(1)
Storage and Systems
(1)
Events
CVPR
(12)
ECCV
(2)
ICCV
(4)
ICLR
(12)
ICML
(8)
ICRA
(4)
NeurIPS
(23)
SIGGRAPH
(10)
113 results found
Artificial Intelligence and Machine Learning
Generative AI
Clear all
Artificial Intelligence and Machine Learning
Generative AI
2025
Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting
Salva Rühling Cachay,
Miika Aittala
,
Karsten Kreis
,
Noah Brenowitz
,
Arash Vahdat
,
Morteza Mardani
, Rose Yu
NeurIPS
Align Your Flow: Scaling Continuous-Time Flow Map Distillation
Amirmojtaba Sabour,
Sanja Fidler
,
Karsten Kreis
NeurIPS
ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning
Chi-Pin Huang, Yueh-Hua Wu,
Min-Hung Chen
,
Frank Wang
,
Fred Yang
NeurIPS
Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail
Marco Pavone
, Many other contributors found on Page 33
VoiceNoNG: Robust High-Quality Speech Editing Model without Hallucinations
Sung-Feng Huang
, Heng-Cheng Kuo, Zhehuai Chen, Xuesong Yang, Pin-Jui Ku, Ante Jukić,
Huck Yang
, Yu Tsao,
Frank Wang
, Hung-yi Lee,
Szu-Wei Fu
Assessing Learned Models for Phase-only Hologram Compression
Zicong Peng, Yicheng Zhan,
Josef Spjut
, Kaan Akşit
SIGGRAPH
Fly, Fail, Fix: Iterative Game Repair with Reinforcement Learning and Large Multimodal Models
Alex Zook
,
Josef Spjut
,
Jonathan Tremblay
FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scale
Boris Bonev
, Thorsten Kurth, Ankur Mahesh, Mauro Bisson,
Jean Kossaifi
, Karthik Kashinath, Anima Anandkumar, William D. Collins,
Mike Pritchard
,
Alex Keller
GenMol: A Drug Discovery Generalist with Discrete Diffusion
Seul Lee,
Karsten Kreis
, Srimukh Prasad Veccham, Meng Liu, Danny Reidenbach, Yuxing Peng, Saee Paliwal,
Weili Nie
,
Arash Vahdat
ICML
Efficient Molecular Conformer Generation with SO(3)-Averaged Flow Matching and Reflow
Zhonglin Cao, Mario Geiger, Allan Dos Santos Costa, Danny Reidenbach,
Karsten Kreis
,
Tomas Geffner
, Franco Pellegrini, Guoqing Zhou, Emine Kucukbenli
ICML
Score-based Diffusion Models in Function Space
Jae Hyun Lim,
Nikola Kovachki
, Ricardo Baptista, Christopher Beckham, Kamyar Azizzadenesheli,
Jean Kossaifi
, Vikram Voleti, Jiaming Song,
Karsten Kreis
,
Jan Kautz
, Christopher Pal,
Arash Vahdat
, Anima Anandkumar
Make It Count: Text-to-Image Generation with an Accurate Number of Objects
Lital Binyamin,
Yoad Tewel
, Eran Hirsch, Royi Rassin,
Gal Chechik
CVPR
Beyond the Buzz: A Pragmatic Take on Inference Disaggregation
Tiyasa Mitra, Ritika Borkar, Nidhi Bhatia, Ramon Matas, Shivam Raj, Dheevatsa Mudigere, Ritchie Zhao, Maximilian Golub, Arpan Dutta, Sailaja Madduri, Dharmesh Jani, Brian Pharris, Bita Darvish Rouhani
Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond
Jessie Richter-Powell, Antonio Torralba, Jonathan Lorraine
ICML
Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models,
Dvir Samuel, Barak Meiri,
Haggai Maron
,
Yoad Tewel
, Nir Darshan, Shai Avidan,
Gal Chechik
, Rami Ben-Ari
ICLR
NVIDIA Isaac GR00T N1: An Open Foundation Model for Humanoid Robots
Yuke Zhu
,
Linxi "Jim" Fan
, NVIDIA GEAR Team
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Zhengyi Wang, Jonathan Lorraine, Yikai Wang, Hang Su, Jun Zhu, Sanja Fidler,
Xiaohui Zeng
Multi-student Diffusion Distillation for Better One-step Generators
Yanke Song, Jonathan Lorraine,
Weili Nie
,
Karsten Kreis
, James Lucas
ICML
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models
Kuan-Hung Liu, Cheng-Kun Yang,
Min-Hung Chen
, Yu-Lun Liu, Yen-Yu Lin
Energy-Based Diffusion Language Models for Text Generation
Minkai Xu,
Tomas Geffner
,
Karsten Kreis
,
Weili Nie
, Yilun Xu, Jure Leskovec, Stefano Ermon,
Arash Vahdat
ICLR
Truncated Consistency Models
Sangyun Lee, Yilun Xu,
Tomas Geffner
, Giulia Fanti,
Karsten Kreis
,
Arash Vahdat
,
Weili Nie
ICLR
Proteina: Scaling Flow-based Protein Structure Generative Models
Tomas Geffner
,
Kieran Didi
, Zuobai Zhang, Danny Reidenbach, Zhonglin Cao, Jason Yim, Mario Geiger, Christian Dallago, Emine Kucukbenli,
Arash Vahdat
,
Karsten Kreis
ICLR
ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids
Hannes Stark, Bowen Jing,
Tomas Geffner
, Jason Yim, Tommi Jaakkola,
Arash Vahdat
,
Karsten Kreis
ICLR
Directed Graph Generation with Heat Kernels
Marc T. Law,
Karsten Kreis
,
Haggai Maron
Cosmos World Foundation Model Platform for Physical AI
Ming-Yu Liu
, Many other contributors at https://d1qx31qr3h6wln.cloudfront.net/publications/NVIDIA%20Cosmos_4.pdf
2024
Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization
Siyi Gu, Minkai Xu, Alexander Powers,
Weili Nie
,
Tomas Geffner
,
Karsten Kreis
, Jure Leskovec,
Arash Vahdat
, Stefano Ermon
NeurIPS
Molecule Generation with Fragment Retrieval Augmentation
Seul Lee,
Karsten Kreis
, Srimukh Prasad Veccham, Meng Liu, Danny Reidenbach, Saee Paliwal,
Arash Vahdat
,
Weili Nie
NeurIPS
L4GM: Large 4D Gaussian Reconstruction Model
Jiawei Ren, Kevin Xie, Ashkan Mirzaei, Hanxue Liang, Xiaohui Zeng,
Karsten Kreis
, Ziwei Liu, Antonio Torralba, Sanja Fidler, Seung Wook Kim, Huan Ling
NeurIPS
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
Giannis Daras,
Weili Nie
,
Karsten Kreis
, Alexandros G. Dimakis,
Morteza Mardani
,
Nikola Kovachki
,
Arash Vahdat
NeurIPS
FactorSim: Generative Simulation via Factorized Representation
Fan-Yun Sun, S. I. Harini, Angela Yi, Yihan Zhou,
Alex Zook
,
Jonathan Tremblay
, Logan Cross, Jiajun Wu, Nick Haber
NeurIPS
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai, Hsiang-Chun Wang, Ping-Chun Hsieh,
Frank Wang
,
Min-Hung Chen
, Shao-Hua Sun
NeurIPS
MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting
Chen Tessler
, Kelly Guo, Ofir Nabati,
Gal Chechik
, Jason Peng
SIGGRAPH
Pagination
Current page
1
Page
2
Page
3
Page
4
Next page
Next ›
Last page
Last »