Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
Scaling Vision Pre-Training to 4K Resolution
Baifeng Shi
,
Boyi Li
,
Han Cai
,
Yao Lu
,
Sifei Liu
,
Marco Pavone
,
Jan Kautz
,
Song Han
,
Trevor Darrell
,
Pavlo Molchanov
,
Hongxu (Danny) Yin
June 2025
Cite
arXiv
Website
Type
Conference paper
Publication
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Highlight
Sifei Liu
Jan Kautz
Team Leader
Pavlo Molchanov
Hongxu (Danny) Yin
Related
NVILA: Efficient Frontier Visual Language Models
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
VILA: On pretraining for vision language models
RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models
Cite
×