Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
De-An Huang
NVIDIA
Interests
Video Understanding
Embodied AI
Latest
I^2SB: Image-to-Image Schrödinger Bridge
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
MinVIS: A minimal video instance segmentation framework without video-based training
Test-time prompt tuning for zero-shot generalization in vision-language models
Cite
×