Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2024
(28)
2023
(153)
2022
(162)
2021
(151)
2020
(135)
2019
(123)
2018
(107)
2017
(85)
2016
(57)
2015
(52)
2014
(23)
2013
(26)
2012
(20)
2011
(29)
2010
(19)
2009
(9)
2008
(13)
2007
(10)
2006
(1)
2005
(3)
2003
(1)
2001
(1)
Facet Publication Year
Research Areas
Artificial Intelligence and Machine Learning
(432)
Computer Vision
(305)
Computer Graphics
(295)
Computer Architecture
(222)
Robotics
(107)
Circuits and VLSI Design
(101)
High Performance Computing
(98)
Real-Time Rendering
(90)
Algorithms and Numerical Methods
(87)
Generative AI
(79)
VR, AR and Display Technology
(62)
Resilience and Safety
(61)
Computational Photography and Imaging
(49)
Programming Languages, Systems and Tools
(49)
Human Computer Interaction
(46)
Autonomous Vehicles
(39)
Speech Processing
(35)
Applied Perception
(26)
Esports
(22)
Hyperscale Graphics
(16)
Medical
(16)
Natural Language Processing
(14)
Telecommunications
(13)
Networking
(12)
Machine Translation
(7)
Quantum Computing
(5)
Climate Simulation
(1)
Storage and Systems
(1)
Events
CORL
(12)
CVPR
(38)
ECCV
(6)
ICCV
(9)
ICLR
(13)
ICML
(11)
ICRA
(28)
IROS
(9)
ISPD
(7)
NeurIPS
(26)
PLDI
(1)
RSS
(5)
SIGGRAPH
(34)
VSS
(2)
2024
DiffiT: Diffusion Vision Transformers for Image Generation
Ali Hatamizadeh
, Jiaming Song, Guilin Liu,
Jan Kautz
,
Arash Vahdat
ECCV
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Amirmojtaba Sabour, Sanja Fidler,
Karsten Kreis
ICML
DoRA: Weight-Decomposed Low-Rank Adaptation
Shih-Yang Liu,
Chien-Yi Wang
,
Hongxu Danny Yin
,
Pavlo Molchanov
,
Frank Wang
, Kwang-Ting Cheng,
Min-Hung Chen
ICML
RVT-2: Learning Precise Manipulation from Few Examples
Ankit Goyal
,
Valts Blukis
,
Jie Xu
,
Yijie Guo
,
Yu-Wei Chao
,
Dieter Fox
Breathing Life Into Sketches Using Text-to-Video Priors
Rinon Gal
, Yael Vinker, Yuval Alaluf, Amit Bermano, Daniel Cohen-Or, Ariel Shamir,
Gal Chechik
CVPR
Improving Hyperparameter Optimization with Checkpointed Model Weights
Nikhil Mehta, Jonathan Lorraine, Steve Masson, Ramanathan Arunachalam, Zaid Pervaiz Bhat, James Lucas, Arun George Zachariah
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling, Seung Wook Kim, Antonio Torralba, Sanja Fidler,
Karsten Kreis
CVPR
Nemotron-4 340B
An Empirical Study of Mamba-based Language Models
Roger Waleffe,
Wonmin Byeon
, Duncan Riach, Brandon Norick, Vijay Korthikanti, Tri Dao, Albert Gu,
Ali Hatamizadeh
, Sudhakar Singh, Deepak Narayanan, Garvit Kulshreshtha, Vartika Singh, Jared Casper,
Jan Kautz
, Mohammad Shoeybi, Bryan Catanzaro
Do Action Video Game Players Search Faster Than Non-Players?
Zoe (Jing) Xu,
Josef Spjut
,
Ben Boudaoud
, Simona Buetti, Alejandro Lleras,
Ruth Rosenholtz
VSS
Full-colour 3D holographic augmented-reality displays with metasurface waveguides
Manu Gopakumar, Gun-Yeal Lee, Suyeon Choi, Brian Chao, Yifan Peng, Jonghyun Kim, Gordon Wetzstein
FasterViT: Fast Vision Transformers with Hierarchical Attention
Ali Hatamizadeh
,
Greg Heinrich
,
Hongxu Danny Yin
, Andrew Tao, Jose M. Alvarez,
Jan Kautz
,
Pavlo Molchanov
ICLR
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
YuChen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng
ICLR
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng,
Huck Yang
ICLR
WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space
Katja Schwarz, Seung Wook Kim, Jun Gao, Sanja Fidler, Andreas Geiger,
Karsten Kreis
ICLR
Filtering After Shading With Stochastic Texture Filtering
Matt Pharr
,
Bartlomiej Wronski
,
Marco Salvi
, Marcos Fajardo
Best Paper
A 0.190-pJ/bit 25.2-Gb/s/wire Inverter-Based AC-Coupled Transceiver for Short-Reach Die-to-Die Interfaces in 5-nm CMOS
Yoshinori Nishi
, John W. Poulton,
Xi Chen
,
Sanquan Song
,
Brian Zimmer
,
Walker Turner
,
Stephen Tell
,
Nikola Nedovic
,
John Wilson
,
William Dally
,
Tom Gray
Is Less More? Rendering for Esports
Benjamin Watson,
Josef Spjut
,
Joohwan Kim
, Byungjoo Lee, Mijin Yoo, Peter Shirley, Rulon Raymond
LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis
Kevin Xie, Jonathan Lorraine, Tianshi Cao, Jun Gao, James Lucas, Antonio Torralba, Sanja Fidler, Xiaohui Zeng
A Chat about Boring Problems: Studying GPT-Based Text Normalization
Yang Zhang, Travis M. Bartley, Mariana Graterol-Fuenmayor, Vitaly Lavrukhin, Evelina Bakhturina, Boris Ginsburg
Novel Transformer Model Based Clustering Method for Standard Cell Design Automation
Chia-Tung (Mark) Ho
, Ajay Chandna, David Guan, Alvin Ho, Minsoo Kim, Yaguang Li,
Haoxing (Mark) Ren
ISPD
Best Paper Award
MedPart: A Multi-Level Evolutionary Differentiable Hypergraph Partitioner
Rongjian Liang
,
Anthony Agnesina
,
Haoxing (Mark) Ren
GPU/ML-Enhanced Large Scale Global Routing Contest
Rongjian Liang
,
Anthony Agnesina
,
Wen-Hao Liu
,
Haoxing (Mark) Ren
Evaluating and Improving Rendered Visual Experiences: Metrics, Compression, Higher Frame Rates & Recoloring
Pontus Ebelin
Estimates of Temporal Edge Detection Filters in Human Vision
Pontus Ebelin
, Gyorgy Denes,
Tomas Akenine-Möller
, Kalle Åström, Magnus Oskarsson, William H. McIlhagga
Quantum computing with subwavelength atomic arrays
Freya Shah,
Taylor Patti
, Oriol Rubies-Bigorda, Susanne F. Yelin
Fast Entropy-Based Methods of Word-Level Confidence Estimation for End-to-End Automatic Speech Recognition
Aleksandr Laptev, Boris Ginsburg
Generating images of rare concepts using pre-trained diffusion models
Dvir Samuel, Rami Ben-Ari, Simon Raviv, Nir Darshan,
Gal Chechik
2023
Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition
Vahid Noroozi, Somshubra Majumdar, Ankur Kumar, Jagadeesh Balam, Boris Ginsburg
Point-Cloud Completion with Pretrained Text-to-image Diffusion Models
Yoni Kasten
, Ohad Rahamim,
Gal Chechik
NeurIPS
Convolutional State Space Models for Long-Range Spatiotemporal Modeling
Jimmy T. H. Smith,
Shalini De Mello
,
Jan Kautz
, Scott Linderman,
Wonmin Byeon
NeurIPS
Generalizable One-shot 3D Neural Head Avatar
Xueting Li
,
Shalini De Mello
,
Sifei Liu
,
Koki Nagano
,
Umar Iqbal
,
Jan Kautz
NeurIPS
Pagination
Current page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
…
Next page
Next ›
Last page
Last »