Home
Publications
NVIDIA Research
Light
Dark
Automatic
Scaling Vision Pre-Training to 4K Resolution
Baifeng Shi
,
Boyi Li
,
Han Cai
,
Yao Lu
,
Sifei Liu
,
Marco Pavone
,
Jan Kautz
,
Song Han
,
Trevor Darrell
,
Pavlo Molchanov
,
Hongxu (Danny) Yin
June 2025
Cite
arXiv
Website
Type
Conference paper
Publication
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Highlight
Sifei Liu
Jan Kautz
Team Leader
Pavlo Molchanov
Hongxu (Danny) Yin
Related
Scaling RL to Long Videos
3D Aware Region Prompted Vision Language Model
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing
Grounded 3D-Aware Spatial Vision-Language Modeling
NVILA: Efficient Frontier Visual Language Models
Cite
×