I am currently a research intern at NVIDIA Research Taiwan and a PhD student at King Abdullah University of Science and Technology (KAUST) supervised by Mohamed Elhoseiny. I was a research scientist intern at Meta AI. My primary research interests are in multi-modal comprehension (LongVU, MiniGPT-4) and generation (StoryGPT-V). For more information about my work and background, please visit my website.