Seokju Cho is a research intern at NVIDIA and a PhD student at KAIST. He has previously held research internships at both NVIDIA and Adobe. His primary research interests lie in 4D understanding and perception using Vision-Language Models (VLMs). For more details on his work, please visit his personal website.