  Yonggan Fu  

 



  ![](/sites/default/files/person/yonggan.jpg)

  

 Yonggan Fu obtained his PhD from [Georgia Institute of Technology](https://www.gatech.edu/) in May 2025. Prior to that, he received his Bachelor's degree with a dual major in Applied Physics and Computer Science from the School of The Gifted Young at the University of Science and Technology of China in 2019. He is a recipient of [IBM PhD Fellowship](https://research.ibm.com/university/awards/fellowships.html) and was selected as [Machine Learning and Systems Rising Stars 2023](https://mlcommons.org/en/rising-stars-2023/).

His research focuses on democratizing cutting-edge AI technology on everyday devices by developing efficient foundation models. His research work has been featured as spotlight papers at ICLR ([2025](https://arxiv.org/pdf/2411.13676) &amp; [2021](https://arxiv.org/pdf/2101.09868) &amp; [2021](https://arxiv.org/pdf/2103.10584) &amp; [2020](https://arxiv.org/pdf/1909.11957)), an oral paper at ECCV ([2024](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/02444.pdf)), and selected as an IEEE Micro Top Pick ([2023](https://arxiv.org/pdf/2206.00877)).



   Research Area(s)

[Artificial Intelligence and Machine Learning ](/research-area/machine-learning-artificial-intelligence)

[Generative AI](/research-area/generative-ai)

[Natural Language Processing](/research-area/natural-language-processing)

 

 

  

 Main Field of Interest

[Artificial Intelligence and Machine Learning ](/research-area/machine-learning-artificial-intelligence)

 

  

 Google Scholar

[https://scholar.google.com/citations?user=pt3GfXcAAAAJ&amp;hl=en&amp;oi=ao](https://scholar.google.com/citations?user=pt3GfXcAAAAJ&hl=en&oi=ao)

 

  

 

 

 



 ### Publications

 

### 2026 

[Nemotron-Labs-Diffusion: A Tri-Mode Language Model Unifying Autoregressive, Diffusion, and Self-Speculation Decoding](/publication/2026-05_nemotron-labs-diffusion-tri-mode-language-model-unifying-autoregressive)

[Yonggan Fu](/person/yonggan-fu), Lexington Whalen, Abhinav Garg, Chengyue Wu, Maksim Khadkevich, Nicolai Oswald, Enze Xie, Daniel Egert, Sharath Turuvekere Sreenivas,, Shizhe Diao, Chenhan Yu, Ye Yu, Weijia Chen, Sajad Norouzi, Jingyu Liu, Shiyi Lan, Ligeng Zhu, Jin Wang, Jindong Jiang, Morteza Mardani, Mehran Maghoumi, Song Han, Ante Jukić, Nima Tajbakhsh, Jan Kautz, [Pavlo Molchanov](/person/pavlo-molchanov)













### 2025 

[Hymba: A Hybrid-head Architecture for Small Language Models](/publication/2025-04_hymba-hybrid-head-architecture-small-language-models)

Xin Dong, [Yonggan Fu\*](/person/yonggan-fu), Shizhe Diao, [Wonmin Byeon](/person/wonmin-byeon), Zijia Chen, Ameya Sunil Mahabaleshwarkar, Shih-Yang Liu, [Matthijs Van keirsbilck](/person/matthijs-van-keirsbilck), [Min-Hung Chen](/person/min-hung-chen), [Yoshi Nishi](/person/yoshi-nishi), Yingyan Celine Lin, [Jan Kautz](/person/jan-kautz), [Pavlo Molchanov](/person/pavlo-molchanov)



[Hymba - ICLR 2025](https://jankautz.com/publications/Hymba_ICLR25.pdf)



ICLR spotlight paper