Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Publications
Our publications provide insight into some of our leading-edge research.
Filters
Search
Apply
Filters
Filters
Publication Year
2025
(3)
2024
(3)
2023
(3)
2022
(1)
2021
(2)
Facet Publication Year
Research Areas
Applied Perception
(12)
Artificial Intelligence and Machine Learning
(12)
Generative AI
(7)
Natural Language Processing
(7)
Computer Vision
(5)
Speech Processing
(5)
Computer Graphics
(3)
Machine Translation
(3)
Computational Photography and Imaging
(1)
Esports
(1)
Human Computer Interaction
(1)
Events
ICLR
(2)
NeurIPS
(1)
SIGGRAPH
(1)
12 results found
Applied Perception
Artificial Intelligence and Machine Learning
Clear all
Applied Perception
Artificial Intelligence and Machine Learning
2025
Spatio-Temporal Context Prompting for Zero-Shot Action Detection
Wei-Jhe Huang,
Min-Hung Chen
, Shang-Hong Lai
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Ci-Siang Lin, Chien-Yi Wang,
Frank Wang
,
Min-Hung Chen
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models
Kuan-Hung Liu, Cheng-Kun Yang,
Min-Hung Chen
, Yu-Lun Liu, Yen-Yu Lin
2024
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
Yuchen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Zhehuai Chen, Eng Siong Chng
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
YuChen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng
ICLR
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng,
Huck Yang
ICLR
2023
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Chen Chen, YuChen Hu,
Huck Yang
, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng
NeurIPS
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Srijith Radhakrishnan,
Huck Yang
, Sumeer Khan, Rohit Kumar, Narsis Kiani, David Gomez-Cabrero, Jesper Tegnér
Subpixel Deblurring of Anti-Aliased Raster Clip Art
Jinfan Yang,
Nicholas Vining
, Shakiba Kheradmand, Nathan Carr, Leonid Sigal, Alla Sheffer
2022
Detecting Viewer-Perceived Intended Vector Sketch Connectivity
Jerry Yin, Chenxi Liu, Rebecca Liu,
Nicholas Vining
, Helge Rhodin, Alla Sheffer
SIGGRAPH
2021
Known unknowns: Learning novel concepts using exploratory reasoning-by-elimination
Harsh Agrawal,
Eli Meirom
,
Yuval Atzmon
,
Shie Mannor
,
Gal Chechik
Oral
Robust Vision-Based Cheat Detection in Competitive Gaming
Aditya Jonnalagadda,
Iuri Frosio
, Seth Schenider, Morgan McGuire,
Joohwan Kim