Yunze Man
University of Illinois Urbana-Champaign
Research

Yunze's research primarily focuses on developing vision-centric multimodal models and AI agents. His research can be summarized into three consecutive thrusts: visual perception, multimodal reasoning, and agentic planning. The visual perception part focuses on object-centric perception systems and tokenization strategies in dynamic scenes; the multimodal reasoning part aligns strong perception models with language models for reasoning tasks; and the agentic planning part transforms the multimodal foundation models into AI agents with chain-of-thought capabilities to solve complex embodied or long-horizon tasks.

Bio

Yunze is a Ph.D. student in Computer Science at the University of Illinois Urbana-Champaign, advised by Yu-Xiong Wang and Liangyan Gui. His research interests lie at the intersection of vision, machine learning, and robotics. He received M.S. in Robotics at Carnegie Mellon University, advised by Kris Kitani. He received his B.S. in Computer Science from Zhejiang University.

Hometown
Xian, Shaanxi, China