Chi-Pin Huang is a Research Intern at NVIDIA Research Taiwan, where his research focuses on vision-language generative models, with a recent interest in world foundation models and large reasoning models. He is currently pursuing a PhD at National Taiwan University under the supervision of Prof. Yu-Chiang Frank Wang. He received his B.S. in Computer Science from National Taiwan University in 2022.