Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
VILA: On pretraining for vision language models
Ji Lin
,
Hongxu (Danny) Yin
,
Wei Ping
,
Yao Lu
,
Pavlo Molchanov
,
Andrew Tao
,
Huizi Mao
,
Jan Kautz
,
Mohammad Shoeybi
,
Song Han
June 2024
Cite
arXiv
Type
Conference paper
Publication
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Hongxu (Danny) Yin
Pavlo Molchanov
Jan Kautz
Team Leader
Related
FasterViT: Fast Vision Transformers with Hierarchical Attention
Cite
×