Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(6)
2024
(10)
2023
(7)
2022
(4)
2021
(3)
2020
(2)
2019
(2)
2018
(2)
2017
(1)
2016
(1)
Facet Publication Year
Research Areas
Applied Perception
(17)
Computer Graphics
(9)
Artificial Intelligence and Machine Learning
(6)
Generative AI
(6)
Natural Language Processing
(5)
Speech Processing
(5)
Computer Vision
(4)
Esports
(4)
Real-Time Rendering
(4)
VR, AR and Display Technology
(4)
Machine Translation
(3)
Human Computer Interaction
(2)
Robotics
(2)
Autonomous Vehicles
(1)
Computational Photography and Imaging
(1)
Events
CVPR
(2)
ICLR
(2)
NeurIPS
(1)
VSS
(1)
17 results found
Applied Perception
Clear all
2024
2023
Applied Perception
2024
Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models
Alexander Popov, Alperen Degirmenci, David Wehr, Shashank Hegde , Ryan Oldja, Alexey Kamenev, Bertrand Douillard, David Nistér, Urs Muller, Ruchi Bhargava,
Stan Birchfield
, Nikolai Smolyanskiy
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
Yuchen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Zhehuai Chen, Eng Siong Chng
Variable Frame Timing Affects Perception of Smoothness in First-Person Gaming
Devi Klein,
Josef Spjut
,
Ben Boudaoud
,
Joohwan Kim
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Bowen Wen
,
Wei Yang
,
Jan Kautz
,
Stan Birchfield
CVPR
Do Action Video Game Players Search Faster Than Non-Players?
Zoe (Jing) Xu,
Josef Spjut
,
Ben Boudaoud
, Simona Buetti, Alejandro Lleras,
Ruth Rosenholtz
VSS
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
YuChen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng
ICLR
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng,
Huck Yang
ICLR
Is Less More? Rendering for Esports
Benjamin Watson,
Josef Spjut
,
Joohwan Kim
, Byungjoo Lee, Mijin Yoo, Peter Shirley, Rulon Raymond
Evaluating and Improving Rendered Visual Experiences: Metrics, Compression, Higher Frame Rates & Recoloring
Pontus Ebelin
Estimates of Temporal Edge Detection Filters in Human Vision
Pontus Ebelin
, Gyorgy Denes,
Tomas Akenine-Möller
, Kalle Åström, Magnus Oskarsson, William H. McIlhagga
2023
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Chen Chen, YuChen Hu,
Huck Yang
, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng
NeurIPS
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Srijith Radhakrishnan,
Huck Yang
, Sumeer Khan, Rohit Kumar, Narsis Kiani, David Gomez-Cabrero, Jesper Tegnér
Constant Field of View Display Size Effects on First-Person Aiming Time
Josef Spjut
,
Ben Boudaoud
,
Joohwan Kim
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
Bowen Wen
,
Jonathan Tremblay
,
Valts Blukis
,
Stephen Tyree
,
Thomas Müller
, Alex Evans, Dieter Fox,
Jan Kautz
,
Stan Birchfield
CVPR
Subpixel Deblurring of Anti-Aliased Raster Clip Art
Jinfan Yang,
Nicholas Vining
, Shakiba Kheradmand, Nathan Carr, Leonid Sigal, Alla Sheffer
Luminance-Preserving and Temporally Stable Daltonization
Pontus Ebelin
,
Cyril Crassin
, Gyorgy Denes, Magnus Oskarsson, Kalle Åström,
Tomas Akenine-Möller
Efficient Dataflow Modeling of Peripheral Encoding in the Human Visual System
Rachel Brown, Vasha DuTell, Bruce Walter,
Ruth Rosenholtz
, Peter Shirley, Morgan McGuire,
David Luebke