Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(27)
2024
(107)
2023
(164)
2022
(164)
2021
(153)
2020
(135)
2019
(123)
2018
(107)
2017
(85)
2016
(57)
2015
(52)
2014
(23)
2013
(26)
2012
(20)
2011
(29)
2010
(19)
2009
(9)
2008
(14)
2007
(10)
2006
(1)
2005
(3)
2003
(1)
2001
(1)
Facet Publication Year
Research Areas
Artificial Intelligence and Machine Learning
(46)
Generative AI
(36)
Computer Graphics
(35)
Computer Vision
(22)
Robotics
(22)
Natural Language Processing
(14)
Applied Perception
(10)
Circuits and VLSI Design
(9)
Speech Processing
(9)
Human Computer Interaction
(8)
Esports
(7)
Machine Translation
(7)
VR, AR and Display Technology
(6)
Algorithms and Numerical Methods
(5)
Real-Time Rendering
(3)
Autonomous Vehicles
(2)
Climate Simulation
(1)
Computational Photography and Imaging
(1)
High Performance Computing
(1)
Networking
(1)
Physical AI
(1)
Quantum Computing
(1)
Events
CORL
(7)
CVPR
(7)
ECCV
(3)
ICLR
(5)
ICML
(3)
ICRA
(2)
IROS
(4)
ISPD
(1)
NeurIPS
(8)
RSS
(2)
SIGGRAPH
(27)
VSS
(1)
107 results found
Clear all
2024
2024
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Huck Yang
, Taejin Park, Yuan Gong, Yuanchao Li, Zhehuai Chen, yen-ting Lin, Chen Chen, Yuchen Hu, Kunal Dhawan, Piotr Zelasko, Chao Zhang, Yun-Nung Chen, Yu Tsao, Jagadeesh Balam, Boris Ginsburg, Shinji Watanabe, Andreas Stolcke
Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization
Siyi Gu, Minkai Xu, Alexander Powers,
Weili Nie
,
Tomas Geffner
,
Karsten Kreis
, Jure Leskovec,
Arash Vahdat
, Stefano Ermon
NeurIPS
Molecule Generation with Fragment Retrieval Augmentation
Seul Lee,
Karsten Kreis
, Srimukh Prasad Veccham, Meng Liu, Danny Reidenbach, Saee Paliwal,
Arash Vahdat
,
Weili Nie
NeurIPS
L4GM: Large 4D Gaussian Reconstruction Model
Jiawei Ren, Kevin Xie, Ashkan Mirzaei, Hanxue Liang, Xiaohui Zeng,
Karsten Kreis
, Ziwei Liu, Antonio Torralba, Sanja Fidler, Seung Wook Kim, Huan Ling
NeurIPS
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
Giannis Daras,
Weili Nie
,
Karsten Kreis
, Alexandros G. Dimakis,
Morteza Mardani
,
Nikola Kovachki
,
Arash Vahdat
NeurIPS
FactorSim: Generative Simulation via Factorized Representation
Fan-Yun Sun, S. I. Harini, Angela Yi, Yihan Zhou,
Alex Zook
,
Jonathan Tremblay
, Logan Cross, Jiajun Wu, Nick Haber
NeurIPS
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai, Hsiang-Chun Wang, Ping-Chun Hsieh,
Frank Wang
,
Min-Hung Chen
, Shao-Hua Sun
NeurIPS
Fast Encoder-Based 3D from Casual Videos via Point Track Processing
Yoni Kasten
, Wuyue Lu,
Haggai Maron
NeurIPS
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models
Yuchen Hu, Chen Chen,
Huck Yang
, Chengwei Qin, Pin-Yu Chen, Eng Siong Chng, Chao Zhang
NeurIPS
MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting
Chen Tessler
, Kelly Guo, Ofir Nabati,
Gal Chechik
, Jason Peng
SIGGRAPH
Large Étendue 3D Holographic Display with Content-adpative Dynamic Fourier Modulation
Brian Chao, Manu Gopakumar, Suyeon Choi, Liang Shi,
Jonghyun Kim
, Gordon Wetzstein
SIGGRAPH
SpecTrack: Learned Multi-Rotation Tracking via Speckle Imaging
Ziyang Chen, Doğa Doğan,
Josef Spjut
, Kaan Akşit
SIGGRAPH
Honorable Mention
DRC-Coder: Automated DRC Checker Code Generation Using LLM Autonomous Agent
Chen-Chia Chang,
Chia-Tung (Mark) Ho
, Yaguang Li, Yiran Chen,
Haoxing (Mark) Ren
Appearance Modeling of Iridescent Feathers with Diverse Nanostructures
Yunchen Yu,
Andrea Weidlich
, Bruce Walter,
Eugene d'Eon
, Steve Marschner
SIGGRAPH
SIGGRAPH Asia 2024 Best Paper Award
scene_synthesizer: A Python Library for Procedural Scene Generation in Robot Manipulation
Clemens Eppner
,
Adithya Murali
,
Caelan Garrett
,
Rowland O'Flaherty
,
Tucker Hermans
,
Wei Yang
,
Dieter Fox
Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities
Siyin Wang,
Huck Yang
, Ji Wu, Chao Zhang
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment
Yusuke Hirota,
Ryo Hachiuma
,
Huck Yang
, Yuta Nakashima
FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model
Yichen Lu, Jiaqi Song,
Huck Yang
, Shinji Watanabe
Differentiable GPU-Parallelized Task and Motion Planning
William Shen,
Caelan Garrett
,
Ankit Goyal
,
Tucker Hermans
,
Fabio Ramos
CORL
SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation
Zihan Zhou, Animesh Garg,
Dieter Fox
,
Caelan Garrett
,
Ajay Mandlekar
CORL
SkillGen: Automated Demonstration Generation for Efficient Skill Learning and Deployment
Caelan Garrett
,
Ajay Mandlekar
,
Bowen Wen
,
Dieter Fox
CORL
NOD-TAMP: Generalizable Long-Horizon Planning with Neural Object Descriptors
Shuo Cheng,
Caelan Garrett
,
Ajay Mandlekar
,
Danfei Xu
CORL
Reconstructing Translucent Thin Objects from Photos
Xi Deng,
Lifan Wu
, Bruce Walter,
Eugene d'Eon
, Ravi Ramamoorthi, Steve Marschner,
Andrea Weidlich
SIGGRAPH
HAMSTER: Hierarchical Action Models for Open-World Robot Manipulation
Yi Li, Yuquan Deng, Jesse Zhang, Joel Jang, Marius Memmel,
Caelan Garrett
,
Fabio Ramos
,
Dieter Fox
,
Anqi Li
, Abhishek Gupta,
Ankit Goyal
CORL
Guiding Long-Horizon Task and Motion Planning with Vision Language Models
Zhutian Yang,
Caelan Garrett
,
Dieter Fox
, Tomás Lozano-Pérez, Leslie Pack Kaelbling
CORL
Open-World Task and Motion Planning via Vision-Language Model Inferred Constraints
Nishanth Kumar, William Shen,
Fabio Ramos
,
Dieter Fox
, Tomás Lozano-Pérez, Leslie Pack Kaelbling,
Caelan Garrett
CORL
Constructability-driven design of frame structures with state-space search methods
Yijiang Huang,
Caelan Garrett
, Caitlin Mueller
ReMatching Dynamic Reconstruction Flow
Sara Oblak, Despoina Paschalidou, Sanja Fidler, Matan Atzmon
ICLR
Interventional Data Generation for Robust and Data-Efficient Robot Imitation Learning
Ryan Hoque,
Ajay Mandlekar
,
Caelan Garrett
, Ken Goldberg,
Dieter Fox
IROS
Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning
Jishnu Jaykumar P, Kamalesh Palanisamy,
Yu-Wei Chao
, Xinya Du, Yu Xiang
IROS
DiMSam: Diffusion Models as Samplers for Task and Motion Planning under Partial Observability
Xiaolin Fang,
Caelan Garrett
,
Clemens Eppner
, Tomás Lozano-Pérez, Leslie Pack Kaelbling,
Dieter Fox
IROS
Best Conference Paper Finalist
Best Student Paper Finalist
DiffiT: Diffusion Vision Transformers for Image Generation
Ali Hatamizadeh
, Jiaming Song, Guilin Liu,
Jan Kautz
,
Arash Vahdat
ECCV
Pagination
Current page
1
Page
2
Page
3
Page
4
Next page
Next ›
Last page
Last »