Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(36)
2024
(43)
2023
(39)
2022
(13)
2021
(7)
2020
(6)
2019
(1)
2018
(3)
Facet Publication Year
Research Areas
Generative AI
(150)
Artificial Intelligence and Machine Learning
(123)
Computer Vision
(62)
Computer Graphics
(41)
Robotics
(13)
Natural Language Processing
(10)
Circuits and VLSI Design
(9)
Speech Processing
(9)
Autonomous Vehicles
(8)
Applied Perception
(7)
Physical AI
(7)
VR, AR and Display Technology
(5)
Machine Translation
(4)
Climate Simulation
(3)
Algorithms and Numerical Methods
(2)
High Performance Computing
(2)
Computational Photography and Imaging
(1)
Computer Architecture
(1)
Esports
(1)
Human Computer Interaction
(1)
Resilience and Safety
(1)
Storage and Systems
(1)
World Simulation
(1)
Events
CVPR
(22)
ECCV
(4)
ICCV
(4)
ICLR
(16)
ICML
(8)
ICRA
(6)
NeurIPS
(25)
RSS
(1)
SIGGRAPH
(18)
150 results found
Generative AI
Clear all
Generative AI
2024
What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs
Alexander Trevithick
, Matthew Chan, Towaki Takikawa,
Umar Iqbal
,
Shalini De Mello
, Manmohan Chandraker, Ravi Ramamoorthi,
Koki Nagano
CVPR
RegionGPT: Towards Region Understanding Vision Language Model
Qiushan Guo,
Shalini De Mello
,
Hongxu Danny Yin
,
Wonmin Byeon
, Ka Chun Cheung, Yizhou Yu, Ping Luo,
Sifei Liu
CVPR
GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning
Ye Yuan
,
Xueting Li
, Yangyi Huang,
Shalini De Mello
,
Koki Nagano
,
Jan Kautz
,
Umar Iqbal
CVPR
Highlight
Dream-in-4D: A Unified Approach for Text- and Image-guided 4D Scene Generation
Yufeng Zheng,
Xueting Li
,
Koki Nagano
,
Sifei Liu
, Otmar Hilliges,
Shalini De Mello
CVPR
Nemotron-4 340B
Flexible Motion In-betweening with Diffusion Models
Setareh Cohan, Guy Tevet, Daniele Reda, Xue Bin Peng, Michiel van de Panne
SIGGRAPH
WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space
Katja Schwarz, Seung Wook Kim, Jun Gao, Sanja Fidler, Andreas Geiger,
Karsten Kreis
ICLR
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
YuChen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng
ICLR
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng,
Huck Yang
ICLR
3D Reconstruction with Generalizable Neural Fields using Scene Priors
Yang Fu,
Shalini De Mello
,
Xueting Li
, Amey Kulkarni,
Jan Kautz
, Xiaolong Wang,
Sifei Liu
ICLR
LCM-Lookahead for Encoder-based Text-to-Image Personalization
Rinon Gal, Or Lichter, Elad Richardson, Or Patashnik, Amit H Bermano,
Gal Chechik
, Daniel Cohen-Or
ECCV
LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis
Kevin Xie, Jonathan Lorraine, Tianshi Cao, Jun Gao, James Lucas, Antonio Torralba, Sanja Fidler, Xiaohui Zeng
ECCV
Consolidating Attention Features for Multi-view Image Editing
Or Patashnik, Rinon Gal, Daniel Cohen-Or, Jun-Yan Zhu, Fernando De la Torre
SIGGRAPH
ConsiStory: Training-Free Consistent Text-to-Image Generation
Yoad Tewel, Omri Kaduri, Rinon Gal,
Yoni Kasten
, Lior Wolf,
Gal Chechik
,
Yuval Atzmon
SIGGRAPH
Generating images of rare concepts using pre-trained diffusion models
Dvir Samuel, Rami Ben-Ari, Simon Raviv, Nir Darshan,
Gal Chechik
2023
Point-Cloud Completion with Pretrained Text-to-image Diffusion Models
Yoni Kasten
, Ohad Rahamim,
Gal Chechik
NeurIPS
SceneScape: Text-Driven Consistent Scene Generation
Rafail Fridman, Amit Abecasis,
Yoni Kasten
, Tali Dekel
NeurIPS
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Chen Chen, YuChen Hu,
Huck Yang
, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng
NeurIPS
Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models
Moab Arar, Rinon Gal,
Yuval Atzmon
,
Gal Chechik
, Daniel Cohen-Or, Ariel Shamir, Amit Bermano
SIGGRAPH
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
Xuanchi Ren, Jiahui Huang, Xiaohui Zeng, Ken Museth, Sanja Fidler, Francis Williams
CVPR
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Srijith Radhakrishnan,
Huck Yang
, Sumeer Khan, Rohit Kumar, Narsis Kiani, David Gomez-Cabrero, Jesper Tegnér
ChipNeMo: Domain-Adapted LLMs for Chip Design
Mingjie Liu
, Teo Ene, Robert Kirby, Chris Cheng,
Nathaniel Pinckney
,
Rongjian Liang
, Jonah Alben, Himyanshu Anand, Sanmitra Banerjee, Ismet Bayraktaroglu, Bonita Bhaskaran, Bryan Catanzaro, Arjun Chaudhuri, Sharon Clay, Bill Dally, Laura Dang, Parikshit Deshpande, Siddhanth Dhodhi, Sameer Halepete, Eric Hill, Jiashang Hu, Sumit Jain,
Brucek Khailany
, George Kokai, Kishor Kunal, Xiaowei Li, Charley Lind, Hao Liu, Stuart Oberman, Sujeet Omar, Sreedhar Pratty, Jonathan Raman, Ambar Sarkar, Zhengjiang Shao, Hanfei Sun, Pratik P Suthar, Varun Tej,
Walker Turner
, Kaizhe Xu,
Haoxing (Mark) Ren
TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models
Tianshi Cao,
Karsten Kreis
, Sanja Fidler, Nicholas Sharp, Kangxue Yin
ICCV
Generative Novel View Synthesis with 3D-Aware Diffusion Models
Eric R. Chan,
Koki Nagano
, Matthew Chan, Alexander W. Bergman, Jeong Joon Park, Axel Levy,
Miika Aittala
,
Shalini De Mello
,
Tero Karras
, Gordon Wetzstein
ICCV
Oral
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Daiqing Li, Huan Ling, Amlan Kar, David Acuna, Seung Wook Kim,
Karsten Kreis
, Antonio Torralba, Sanja Fidler
ICCV
ATT3D: Amortized Text-To-3D Object Synthesis
Jonathan Lorraine, Kevin Xie, Xiaohui Zeng,
Chen-Hsuan Lin
, Towaki Takikawa, Nicholas Sharp,
Tsung-Yi Lin
,
Ming-Yu Liu
, Sanja Fidler, James Lucas
ICCV
Syntactic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment
Royi Rassin, Eran Hirsch, Daniel Glickman, Shauli Ravfogel, Yoav Goldberg,
Gal Chechik
NeurIPS
Oral presentation
Norm-guided latent space exploration for text-to-image generation
Dvir Samuel, Rami Ben-Ari, Nir Darshan,
Haggai Maron
,
Gal Chechik
NeurIPS
VerilogEval: Evaluating Large Language Models for Verilog Code Generation
Mingjie Liu
,
Nathaniel Pinckney
,
Brucek Khailany
,
Mark Haoxing Ren
Differentially Private Diffusion Models
Tim Dockhorn, Tianshi Cao,
Arash Vahdat
,
Karsten Kreis
Flexible Isosurface Extraction for Gradient-Based Mesh Optimization
Tianchang Shen,
Jacob Munkberg
,
Jon Hasselgren
, Kangxue Yin, Zian Wang, Wenzheng Chen, Zan Gojcic, Sanja Fidler, Nicholas Sharp, Jun Gao
SIGGRAPH
IGB: Addressing The Gaps In Labeling, Features, Heterogeneity, and Size of Public Graph Datasets for Deep Learning Research
Arpandeep Khatua,
Vikram Sharma Mailthody
, Bhagyashree Taleka, Tengfei Ma, Xiang Song,
Wen-mei Hwu
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Page
2
Current page
3
Page
4
Page
5
Next page
Next ›
Last page
Last »