Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
VILA-U: Efficient and Unified Visual Language Understanding and Generation
Yecheng Wu
,
Zhuoyang Zhang
,
Junyu Chen
,
Haotian Tang
,
Dacheng Li
,
Yunhao Fang
,
Ligeng Zhu
,
Enze Xie
,
Hongxu (Danny) Yin
,
Li Yi
,
Song Han
,
Yao Lu
April 2025
Cite
arXiv
Type
Conference paper
Publication
International Conference on Learning Representations (ICLR)
Hongxu (Danny) Yin
Related
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
NVILA: Efficient Frontier Visual Language Models
Cite
×