Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(2)
2024
(17)
2023
(34)
2022
(11)
2021
(7)
2020
(6)
2019
(1)
2018
(3)
Facet Publication Year
Research Areas
Artificial Intelligence and Machine Learning
(81)
Generative AI
(81)
Computer Vision
(40)
Computer Graphics
(26)
Applied Perception
(7)
Circuits and VLSI Design
(6)
Natural Language Processing
(6)
Robotics
(5)
Speech Processing
(5)
Autonomous Vehicles
(4)
Machine Translation
(3)
VR, AR and Display Technology
(2)
Climate Simulation
(1)
Computational Photography and Imaging
(1)
Computer Architecture
(1)
Esports
(1)
Human Computer Interaction
(1)
Physical AI
(1)
Storage and Systems
(1)
Events
CVPR
(10)
ECCV
(2)
ICCV
(4)
ICLR
(7)
ICML
(3)
ICRA
(4)
NeurIPS
(15)
SIGGRAPH
(8)
81 results found
Artificial Intelligence and Machine Learning
Generative AI
Clear all
Artificial Intelligence and Machine Learning
Generative AI
2025
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models
Kuan-Hung Liu, Cheng-Kun Yang,
Min-Hung Chen
, Yu-Lun Liu, Yen-Yu Lin
Cosmos World Foundation Model Platform for Physical AI
Ming-Yu Liu
, Many other contributors at https://d1qx31qr3h6wln.cloudfront.net/publications/NVIDIA%20Cosmos_4.pdf
2024
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai, Hsiang-Chun Wang, Ping-Chun Hsieh,
Frank Wang
,
Min-Hung Chen
, Shao-Hua Sun
NeurIPS
MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting
Chen Tessler
, Kelly Guo, Ofir Nabati,
Gal Chechik
, Jason Peng
SIGGRAPH
DRC-Coder: Automated DRC Checker Code Generation Using LLM Autonomous Agent
Chen-Chia Chang,
Chia-Tung (Mark) Ho
, Yaguang Li, Yiran Chen,
Haoxing (Mark) Ren
Learning to Move Like Professional Counter-Strike Players
David Durst, F. Xie, V. Sarukkai, Brennan Shacklett,
Iuri Frosio
,
Chen Tessler
,
Joohwan Kim
, C. Taylor, G. Bernstein, S. Choudhury, P. Hanrahan,, Kayvon Fatahalian
Kilometer-Scale Convection Allowing Model Emulation using Generative Diffusion Modeling
Jaideep Pathak
,
Yair Cohen
, Piyush Garg, Peter Harrington,
Noah Brenowitz
,
Dale Durran
,
Morteza Mardani
,
Arash Vahdat
, Shaoming Xu, Karthik Kashinath,
Mike Pritchard
VerilogCoder: Autonomous Verilog Coding Agents with Graph-based Planning and Abstract Syntax Tree (AST)-based Waveform Tracing Tool
Chia-Tung (Mark) Ho
,
Haoxing (Mark) Ren
,
Brucek Khailany
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
Yuchen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Zhehuai Chen, Eng Siong Chng
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Amirmojtaba Sabour, Sanja Fidler,
Karsten Kreis
ICML
DoRA: Weight-Decomposed Low-Rank Adaptation
Shih-Yang Liu, Chien-Yi Wang,
Hongxu Danny Yin
,
Pavlo Molchanov
,
Frank Wang
, Kwang-Ting Cheng,
Min-Hung Chen
ICML
Large Language Model (LLM) for Standard Cell Layout Design Optimization
Chia-Tung (Mark) Ho
,
Haoxing (Mark) Ren
Best Paper Award
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling, Seung Wook Kim, Antonio Torralba, Sanja Fidler,
Karsten Kreis
CVPR
Nemotron-4 340B
WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space
Katja Schwarz, Seung Wook Kim, Jun Gao, Sanja Fidler, Andreas Geiger,
Karsten Kreis
ICLR
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
YuChen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng
ICLR
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng,
Huck Yang
ICLR
LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis
Kevin Xie, Jonathan Lorraine, Tianshi Cao, Jun Gao, James Lucas, Antonio Torralba, Sanja Fidler, Xiaohui Zeng
ECCV
Generating images of rare concepts using pre-trained diffusion models
Dvir Samuel, Rami Ben-Ari, Simon Raviv, Nir Darshan,
Gal Chechik
2023
Point-Cloud Completion with Pretrained Text-to-image Diffusion Models
Yoni Kasten
, Ohad Rahamim,
Gal Chechik
NeurIPS
SceneScape: Text-Driven Consistent Scene Generation
Rafail Fridman, Amit Abecasis,
Yoni Kasten
, Tali Dekel
NeurIPS
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Chen Chen, YuChen Hu,
Huck Yang
, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng
NeurIPS
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
Xuanchi Ren, Jiahui Huang, Xiaohui Zeng, Ken Museth, Sanja Fidler, Francis Williams
CVPR
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Srijith Radhakrishnan,
Huck Yang
, Sumeer Khan, Rohit Kumar, Narsis Kiani, David Gomez-Cabrero, Jesper Tegnér
ChipNeMo: Domain-Adapted LLMs for Chip Design
Mingjie Liu
, Teo Ene, Robert Kirby, Chris Cheng,
Nathaniel Pinckney
,
Rongjian Liang
, Jonah Alben, Himyanshu Anand, Sanmitra Banerjee, Ismet Bayraktaroglu, Bonita Bhaskaran, Bryan Catanzaro, Arjun Chaudhuri, Sharon Clay, Bill Dally, Laura Dang, Parikshit Deshpande, Siddhanth Dhodhi, Sameer Halepete, Eric Hill, Jiashang Hu, Sumit Jain,
Brucek Khailany
, George Kokai, Kishor Kunal, Xiaowei Li, Charley Lind, Hao Liu, Stuart Oberman, Sujeet Omar, Sreedhar Pratty, Jonathan Raman, Ambar Sarkar, Zhengjiang Shao, Hanfei Sun, Pratik P Suthar, Varun Tej,
Walker Turner
, Kaizhe Xu,
Haoxing (Mark) Ren
TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models
Tianshi Cao,
Karsten Kreis
, Sanja Fidler, Nicholas Sharp, Kangxue Yin
ICCV
Generative Novel View Synthesis with 3D-Aware Diffusion Models
Eric R. Chan,
Koki Nagano
, Matthew Chan, Alexander W. Bergman, Jeong Joon Park, Axel Levy,
Miika Aittala
,
Shalini De Mello
,
Tero Karras
, Gordon Wetzstein
ICCV
Oral
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Daiqing Li, Huan Ling, Amlan Kar, David Acuna, Seung Wook Kim,
Karsten Kreis
, Antonio Torralba, Sanja Fidler
ICCV
ATT3D: Amortized Text-To-3D Object Synthesis
Jonathan Lorraine, Kevin Xie, Xiaohui Zeng,
Chen-Hsuan Lin
, Towaki Takikawa, Nicholas Sharp,
Tsung-Yi Lin
,
Ming-Yu Liu
, Sanja Fidler, James Lucas
ICCV
Syntactic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment
Royi Rassin, Eran Hirsch, Daniel Glickman, Shauli Ravfogel, Yoav Goldberg,
Gal Chechik
NeurIPS
Oral presentation
Norm-guided latent space exploration for text-to-image generation
Dvir Samuel, Rami Ben-Ari, Nir Darshan,
Haggai Maron
,
Gal Chechik
NeurIPS
Differentially Private Diffusion Models
Tim Dockhorn, Tianshi Cao,
Arash Vahdat
,
Karsten Kreis
Pagination
Current page
1
Page
2
Page
3
Next page
Next ›
Last page
Last »