Research Labs
All Research Labs
Spatial Intelligence
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
Spatial Intelligence
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2026
(8)
2025
(39)
2024
(43)
2023
(39)
2022
(13)
2021
(7)
2020
(6)
2019
(1)
2018
(3)
Facet Publication Year
Research Areas
Generative AI
(161)
Artificial Intelligence and Machine Learning
(131)
Computer Vision
(66)
Computer Graphics
(42)
Robotics
(13)
Natural Language Processing
(11)
Autonomous Vehicles
(9)
Circuits and VLSI Design
(9)
Physical AI
(9)
Speech Processing
(9)
Applied Perception
(7)
VR, AR and Display Technology
(5)
Machine Translation
(4)
Algorithms and Numerical Methods
(3)
Climate Simulation
(3)
High Performance Computing
(2)
Human Computer Interaction
(2)
World Simulation
(2)
Computational Photography and Imaging
(1)
Computer Architecture
(1)
Esports
(1)
Medical
(1)
Resilience and Safety
(1)
Storage and Systems
(1)
Events
CVPR
(23)
ECCV
(4)
ICCV
(4)
ICLR
(18)
ICML
(8)
ICRA
(6)
NeurIPS
(27)
RSS
(1)
SIGGRAPH
(18)
161 results found
Generative AI
Clear all
Generative AI
2025
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation
Alexander H. Liu, Sang-gil Lee,
Huck Yang
, Yuan Gong,
Frank Wang
, James R. Glas, Rafael Valle
ICLR
Minitron-SSM: Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning
Ali Taghibakhshi, Sharath Turuvekere Sreenivas,
Saurav Muralidharan
, Marcin Chochowski, Yashaswi Karnati, Raviraj Joshi, Ameya Sunil Mahabaleshwarkar, Zijia Chen, Yoshi Suhara, Oluwatobi Olabiyi, Daniel Korzekwa, Mostofa Patwary, Mohammad Shoeybi,
Jan Kautz
, Bryan Catanzaro, Ashwath Aithal, Nima Tajbakhsh,
Pavlo Molchanov
NeurIPS
Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models,
Dvir Samuel, Barak Meiri,
Haggai Maron
,
Yoad Tewel
, Nir Darshan, Shai Avidan,
Gal Chechik
, Rami Ben-Ari
ICLR
Cosmos Transfer 1: World-to-World Transfer with Adaptive Multi-Control for Physical AI
Ming-Yu Liu
Cosmos-Reason 1: From Physical AI Common Sense to Embodied Decisions
Tsung-Yi Lin
,
Ming-Yu Liu
NVIDIA Isaac GR00T N1: An Open Foundation Model for Humanoid Robots
Yuke Zhu
,
Linxi "Jim" Fan
, NVIDIA GEAR Team
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Zhengyi Wang, Jonathan Lorraine, Yikai Wang, Hang Su, Jun Zhu, Sanja Fidler, Xiaohui Zeng
Multi-student Diffusion Distillation for Better One-step Generators
Yanke Song, Jonathan Lorraine, Weili Nie,
Karsten Kreis
, James Lucas
ICML
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models
Kuan-Hung Liu, Cheng-Kun Yang,
Min-Hung Chen
, Yu-Lun Liu, Yen-Yu Lin
Energy-Based Diffusion Language Models for Text Generation
Minkai Xu,
Tomas Geffner
,
Karsten Kreis
, Weili Nie, Yilun Xu, Jure Leskovec, Stefano Ermon,
Arash Vahdat
ICLR
Truncated Consistency Models
Sangyun Lee, Yilun Xu,
Tomas Geffner
, Giulia Fanti,
Karsten Kreis
,
Arash Vahdat
, Weili Nie
ICLR
Proteina: Scaling Flow-based Protein Structure Generative Models
Tomas Geffner
,
Kieran Didi
, Zuobai Zhang, Danny Reidenbach, Zhonglin Cao, Jason Yim, Mario Geiger, Christian Dallago, Emine Kucukbenli,
Arash Vahdat
,
Karsten Kreis
ICLR
ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids
Hannes Stark, Bowen Jing,
Tomas Geffner
, Jason Yim, Tommi Jaakkola,
Arash Vahdat
,
Karsten Kreis
ICLR
Directed Graph Generation with Heat Kernels
Marc T. Law,
Karsten Kreis
,
Haggai Maron
Cosmos World Foundation Model Platform for Physical AI
Ming-Yu Liu
, Many other contributors at https://d1qx31qr3h6wln.cloudfront.net/publications/NVIDIA%20Cosmos_4.pdf,
Jing Zhang
2024
Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization
Siyi Gu, Minkai Xu, Alexander Powers, Weili Nie,
Tomas Geffner
,
Karsten Kreis
, Jure Leskovec,
Arash Vahdat
, Stefano Ermon
NeurIPS
Molecule Generation with Fragment Retrieval Augmentation
Seul Lee,
Karsten Kreis
, Srimukh Prasad Veccham, Meng Liu, Danny Reidenbach, Saee Paliwal,
Arash Vahdat
, Weili Nie
NeurIPS
L4GM: Large 4D Gaussian Reconstruction Model
Jiawei Ren, Kevin Xie, Ashkan Mirzaei, Hanxue Liang, Xiaohui Zeng,
Karsten Kreis
, Ziwei Liu, Antonio Torralba, Sanja Fidler, Seung Wook Kim, Huan Ling
NeurIPS
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
Giannis Daras, Weili Nie,
Karsten Kreis
, Alexandros G. Dimakis,
Morteza Mardani
,
Nikola Kovachki
,
Arash Vahdat
NeurIPS
FactorSim: Generative Simulation via Factorized Representation
Fan-Yun Sun, S. I. Harini, Angela Yi, Yihan Zhou,
Alex Zook
,
Jonathan Tremblay
, Logan Cross, Jiajun Wu, Nick Haber
NeurIPS
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai, Hsiang-Chun Wang, Ping-Chun Hsieh,
Frank Wang
,
Min-Hung Chen
, Shao-Hua Sun
NeurIPS
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models
Yuchen Hu, Chen Chen,
Huck Yang
, Chengwei Qin, Pin-Yu Chen, Eng Siong Chng, Chao Zhang
NeurIPS
MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting
Chen Tessler
, Kelly Guo, Ofir Nabati,
Gal Chechik
, Jason Peng
SIGGRAPH
Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Sung-Feng Huang
, Heng-Cheng Kuo, Zhehuai Chen, Xuesong Yang,
Huck Yang
, Yu Tsao,
Frank Wang
, Hung-yi Lee,
Szu-Wei Fu
DRC-Coder: Automated DRC Checker Code Generation Using LLM Autonomous Agent
Chen-Chia Chang,
Chia-Tung (Mark) Ho
, Yaguang Li, Yiran Chen, Mark Haoxing Ren
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment
Yusuke Hirota,
Ryo Hachiuma
,
Huck Yang
, Yuta Nakashima
Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos
Ekta Prashnani
,
Koki Nagano
,
Shalini De Mello
,
David Luebke
, Orazio Gallo
ECCV
Learning to Move Like Professional Counter-Strike Players
David Durst, F. Xie, V. Sarukkai, Brennan Shacklett,
Iuri Frosio
,
Chen Tessler
,
Joohwan Kim
, C. Taylor, G. Bernstein, S. Choudhury, P. Hanrahan,, Kayvon Fatahalian
Kilometer-Scale Convection Allowing Model Emulation using Generative Diffusion Modeling
Jaideep Pathak
, Yair Cohen, Piyush Garg, Peter Harrington,
Noah Brenowitz
,
Dale Durran
,
Morteza Mardani
,
Arash Vahdat
, Shaoming Xu, Karthik Kashinath,
Mike Pritchard
VerilogCoder: Autonomous Verilog Coding Agents with Graph-based Planning and Abstract Syntax Tree (AST)-based Waveform Tracing Tool
Chia-Tung (Mark) Ho
, Mark Haoxing Ren,
Brucek Khailany
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
Yuchen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Zhehuai Chen, Eng Siong Chng
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models
Gilad Deutch, Rinon Gal, Daniel Garibi, Or Patashnik, Daniel Cohen-Or
SIGGRAPH
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Current page
2
Page
3
Page
4
Page
5
Page
6
Next page
Next ›
Last page
Last »