Home
Publications
NVIDIA Research
Light
Dark
Automatic
Yu-Chiang Frank Wang
Latest
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks
DoRA: Weight-decomposed Low-rank Adaptation
Cite
×