Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Skip to main content
Artificial Intelligence Computing Leadership from NVIDIA
Login
Research Labs
All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
Publications
AI Playground
New and Featured
AI Art Gallery
NGC Demos
Research Areas
AI & Machine Learning
3D Deep Learning
Computer Vision
Robotics
All Areas
Careers
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team
Licensing
Search
Search
Enter the terms you wish to search for.
Research Areas
Natural Language Processing
Associated Publications
2024
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models
Yuchen Hu, Chen Chen,
Huck Yang
, Chengwei Qin, Pin-Yu Chen, Eng Siong Chng, Chao Zhang
NeurIPS
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment
Yusuke Hirota,
Ryo Hachiuma
,
Huck Yang
, Yuta Nakashima
EMNLP
Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities
Siyin Wang,
Huck Yang
, Ji Wu, Chao Zhang
EMNLP
FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model
Yichen Lu, Jiaqi Song,
Huck Yang
, Shinji Watanabe
EMNLP
Open-World Task and Motion Planning via Vision-Language Model Inferred Constraints
Nishanth Kumar,
Fabio Ramos
,
Dieter Fox
,
Caelan Garrett
CoRL 2024 Workshop on Language and Robot Learning Language as an Interface
HAMSTER: Hierarchical Action Models for Open-World Robot Manipulation
Yi Li, Yuquan Deng, Jesse Zhang, Joel Jang, Marius Memmel,
Caelan Garrett
,
Fabio Ramos
,
Dieter Fox
,
Anqi Li
, Abhishek Gupta,
Ankit Goyal
CoRL 2024 Workshop on Language and Robot Learning Language as an Interface
Guiding Long-Horizon Task and Motion Planning with Vision Language Models
Zhutian Yang,
Caelan Garrett
,
Dieter Fox
, Tomás Lozano-Pérez, Leslie Pack Kaelbling
CoRL 2024 Workshop on Language and Robot Learning Language as an Interface
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
Yuchen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Zhehuai Chen, Eng Siong Chng
ACL 2024
DoRA: Weight-Decomposed Low-Rank Adaptation
Shih-Yang Liu,
Chien-Yi Wang
,
Hongxu Danny Yin
,
Pavlo Molchanov
,
Frank Wang
, Kwang-Ting Cheng,
Min-Hung Chen
International Conference on Machine Learning (ICML) 2024
An Empirical Study of Mamba-based Language Models
Roger Waleffe,
Wonmin Byeon
, Duncan Riach, Brandon Norick, Vijay Korthikanti, Tri Dao, Albert Gu,
Ali Hatamizadeh
, Sudhakar Singh, Deepak Narayanan, Garvit Kulshreshtha, Vartika Singh, Jared Casper,
Jan Kautz
, Mohammad Shoeybi, Bryan Catanzaro
https://arxiv.org/pdf/2406.07887
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng,
Huck Yang
ICLR 2024
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
YuChen Hu, Chen Chen,
Huck Yang
, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng
ICLR 2024
A Chat about Boring Problems: Studying GPT-Based Text Normalization
Yang Zhang, Travis M. Bartley, Mariana Graterol-Fuenmayor, Vitaly Lavrukhin, Evelina Bakhturina, Boris Ginsburg
ICASSP
2023
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Chen Chen, YuChen Hu,
Huck Yang
, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng
NeurIPS 2023
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Srijith Radhakrishnan,
Huck Yang
, Sumeer Khan, Rohit Kumar, Narsis Kiani, David Gomez-Cabrero, Jesper Tegnér
EMNLP
NeMo Guardrails: A Toolkit for Controllable and Safe LLM Applications with Programmable Rails
Traian Rebedea, Razvan Dinu, Makesh Sreedhar, Christopher Parisien, Jonathan Cohen
SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Yi Dong, Zhilin Wang, Makesh Narsimhan Sreedhar, Xianchao Wu, Oleksii Kuchaiev
2022
Evaluating Parameter Efficient Learning for Generation
Peng Xu, Mostofa Patwary, Shrimai Prabhumoye, Virginia Adams, Ryan J. Prenger, Wei Ping, Nayeon Lee, Mohammad Shoeybi, Bryan Catanzaro
Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg
Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization
Evelina Bakhturina, Yang Zhang, Boris Ginsburg
2021
Text Mining Drug/Chemical-Protein Interactions using an Ensemble of BERT and T5 Based Models
Virginia Adams, Hoo-Chang Shin, Carol Anderson, Bo Liu, Anas Abidin
A Unified Transformer-based Framework for Duplex Text Normalization
Tuan Manh Lai, Yang Zhang, Evelina Bakhturina , Boris Ginsburg, Heng Ji
SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Yang Zhang, Vahid Noroozi, Evelina Bakhturina, Boris Ginsburg
NeMo Inverse Text Normalization: From Development To Production
Yang Zhang, Evelina Bakhturina, Kyle Gorman, Boris Ginsburg
2020
BioMegatron: Larger Biomedical Domain Language Model
Hoo-Chang Shin, Yang Zhang, Evelina Bakhturina, Raul Puri, Mostofa Patwary, Mohammad Shoeybi, Raghav Mani
ACL Anthology
A Fast and Robust BERT-based Dialogue State Tracker for Schema-Guided Dialogue Dataset
Vahid Noroozi, Yang Zhang, Evelina Bakhturina, Tomasz Kornuta
Researchers
Huck Yang
Jaesung Choe
Shizhe Diao
Yejin Choi