  Chi-Pin Huang  

 



  ![](/sites/default/files/person/%E5%A4%A7%E9%A0%AD%E8%B2%BC.jpg)

  

 Chi-Pin Huang is a Research Scientist at [NVIDIA Research Taiwan](https://research.nvidia.com/labs/twn/). His research focuses on Vision-Language Generative Models and Vision-Language-Action Models (VLAs), with particular interest in bridging perception, generation, and decision-making. He received his Ph.D. degree from [National Taiwan University](https://www.ntu.edu.tw/english/) in 2026 under the supervision of [Prof. Yu-Chiang Frank Wang](https://vllab.ee.ntu.edu.tw/ycwang.html), and earned his B.S. degree in [Computer Science from National Taiwan University](https://www.csie.ntu.edu.tw/) in 2022.

\[[Personal Page](https://jasper0314-huang.github.io/)\]



   Research Area(s)

[Artificial Intelligence and Machine Learning ](/research-area/machine-learning-artificial-intelligence)

[Computer Vision](/research-area/computer-vision)

[Robotics](/research-area/robotics)

 

 

  

 Main Field of Interest

[Artificial Intelligence and Machine Learning ](/research-area/machine-learning-artificial-intelligence)

 

  

 Google Scholar

<https://scholar.google.com/citations?user=s8-yTSwAAAAJ>

 

  

 

 

 



 ### Publications

 

### 2025 

[ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning](/index.php/publication/2025-12_thinkact-vision-language-action-reasoning-reinforced-visual-latent-planning)

[Chi-Pin Huang](/index.php/person/chi-pin-huang), Yueh-Hua Wu, [Min-Hung Chen](/index.php/person/min-hung-chen), [Frank Wang](/index.php/person/frank-wang), [Fred Yang](/index.php/person/fred-yang)



[Neural Information Processing Systems (NeurIPS) 2025](https://arxiv.org/pdf/2507.16815)