Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(5)
2024
(10)
2023
(7)
2022
(4)
2021
(3)
2020
(2)
2019
(2)
2018
(2)
2017
(1)
2016
(1)
Facet Publication Year
Research Areas
Applied Perception
(37)
Computer Graphics
(20)
Artificial Intelligence and Machine Learning
(12)
Computer Vision
(11)
VR, AR and Display Technology
(11)
Human Computer Interaction
(10)
Esports
(9)
Generative AI
(7)
Natural Language Processing
(7)
Robotics
(5)
Speech Processing
(5)
Autonomous Vehicles
(4)
Real-Time Rendering
(4)
Machine Translation
(3)
Algorithms and Numerical Methods
(1)
Computational Photography and Imaging
(1)
Events
CVPR
(3)
ICLR
(2)
ICRA
(1)
IROS
(1)
NeurIPS
(1)
SIGGRAPH
(2)
VSS
(1)
37 results found
Applied Perception
Clear all
Applied Perception
2025
Toward Understanding Display Size for FPS Esports Aiming
Arjun Madhusudan,
Josef Spjut
, Benjamin Watson, Seth Schneider,
Ben Boudaoud
,
Joohwan Kim
Pushing the Limits? Frame Rate Benefits to Players for up to 500 Hz in First Person Shooter Games
Samin Shahriar Tokey,
Ben Boudaoud
,
Joohwan Kim
,
Josef Spjut
, Mark Claypool
Spatio-Temporal Context Prompting for Zero-Shot Action Detection
Wei-Jhe Huang,
Min-Hung Chen
, Shang-Hong Lai
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Ci-Siang Lin, Chien-Yi Wang,
Frank Wang
,
Min-Hung Chen
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models
Kuan-Hung Liu, Cheng-Kun Yang,
Min-Hung Chen
, Yu-Lun Liu, Yen-Yu Lin
2024
Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models
Alexander Popov, Alperen Degirmenci, David Wehr, Shashank Hegde , Ryan Oldja, Alexey Kamenev, Bertrand Douillard, David Nistér, Urs Muller, Ruchi Bhargava,
Stan Birchfield
, Nikolai Smolyanskiy
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
Yuchen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Zhehuai Chen, Eng Siong Chng
Variable Frame Timing Affects Perception of Smoothness in First-Person Gaming
Devi Klein,
Josef Spjut
,
Ben Boudaoud
,
Joohwan Kim
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Bowen Wen
,
Wei Yang
,
Jan Kautz
,
Stan Birchfield
CVPR
Do Action Video Game Players Search Faster Than Non-Players?
Zoe (Jing) Xu,
Josef Spjut
,
Ben Boudaoud
, Simona Buetti, Alejandro Lleras,
Ruth Rosenholtz
VSS
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
YuChen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng
ICLR
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng,
Huck Yang
ICLR
Is Less More? Rendering for Esports
Benjamin Watson,
Josef Spjut
,
Joohwan Kim
, Byungjoo Lee, Mijin Yoo, Peter Shirley, Rulon Raymond
Evaluating and Improving Rendered Visual Experiences: Metrics, Compression, Higher Frame Rates & Recoloring
Pontus Ebelin
Estimates of Temporal Edge Detection Filters in Human Vision
Pontus Ebelin
, Gyorgy Denes,
Tomas Akenine-Möller
, Kalle Åström, Magnus Oskarsson, William H. McIlhagga
2023
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Chen Chen, YuChen Hu,
Huck Yang
, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng
NeurIPS
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Srijith Radhakrishnan,
Huck Yang
, Sumeer Khan, Rohit Kumar, Narsis Kiani, David Gomez-Cabrero, Jesper Tegnér
Constant Field of View Display Size Effects on First-Person Aiming Time
Josef Spjut
,
Ben Boudaoud
,
Joohwan Kim
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
Bowen Wen
,
Jonathan Tremblay
,
Valts Blukis
,
Stephen Tyree
,
Thomas Müller
, Alex Evans,
Dieter Fox
,
Jan Kautz
,
Stan Birchfield
CVPR
Subpixel Deblurring of Anti-Aliased Raster Clip Art
Jinfan Yang,
Nicholas Vining
, Shakiba Kheradmand, Nathan Carr, Leonid Sigal, Alla Sheffer
Luminance-Preserving and Temporally Stable Daltonization
Pontus Ebelin
,
Cyril Crassin
, Gyorgy Denes, Magnus Oskarsson, Kalle Åström,
Tomas Akenine-Möller
Efficient Dataflow Modeling of Peripheral Encoding in the Human Visual System
Rachel Brown, Vasha DuTell, Bruce Walter,
Ruth Rosenholtz
, Peter Shirley, Morgan McGuire,
David Luebke
2022
Image Features Influence Reaction Time: A Learned Probabilistic Perceptual Model for Saccade Latency
Budmonde Duinkharjav, Praneeth Chakravarthula, Rachel Brown, Anjul Patney, Qi Sun
Best Technical Paper, SIGGRAPH 2022
As-Locally-Uniform-as-Possible Reshaping of Vector Clip Art
Chrystiano Araujo,
Nicholas Vining
, Enrique Rosales, Giorgio Gori, Alla Sheffer
Detecting Viewer-Perceived Intended Vector Sketch Connectivity
Jerry Yin, Chenxi Liu, Rebecca Liu,
Nicholas Vining
, Helge Rhodin, Alla Sheffer
SIGGRAPH
PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for Planning, Control, and Simulation
Alexey Kamenev, Lirui Wang, Ollin Boer Bohan, Ishwar Kulkarni, Bilal Kartal, Artem Molchanov,
Stan Birchfield
, David Nister, Nikolai Smolyanskiy
ICRA
2021
StrokeStrip: Joint Parameterization and Fitting of Stroke Clusters
Dave Pagurek van Mossel, Chenxi Liu,
Nicholas Vining
, Mikhail Bessmeltsev, Alla Sheffer
SIGGRAPH
Known unknowns: Learning novel concepts using exploratory reasoning-by-elimination
Harsh Agrawal,
Eli Meirom
,
Yuval Atzmon
,
Shie Mannor
,
Gal Chechik
Oral
Robust Vision-Based Cheat Detection in Competitive Gaming
Aditya Jonnalagadda,
Iuri Frosio
, Seth Schenider, Morgan McGuire,
Joohwan Kim
2020
MVLidarNet: Real-Time Multi-Class Scene Understanding for Autonomous Driving Using Multiple Views
Ke Chen, Ryan Oldja, Nikolai Smolyanskiy,
Stan Birchfield
, Alexander Popov, David Wehr, Ibrahim Eden, Joachim Pehserl
IROS
Eccentricity Effects on Blur and Depth Perception
Qi Sun, Fu-Chung Huang, Li-Yi Wei,
David Luebke
, Arie Kaufman,
Joohwan Kim
2019
FirstPersonScience: Quantifying Psychophysics for First Person Shooter Tasks
Josef Spjut
,
Ben Boudaoud
, Kamran Binaee, Zander Majercik, Morgan McGuire,
Joohwan Kim
Pagination
Current page
1
Page
2
Next page
Next ›
Last page
Last »