Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
Scaling Vision Pre-Training to 4K Resolution
Baifeng Shi
,
Boyi Li
,
Han Cai
,
Yao Lu
,
Sifei Liu
,
Marco Pavone
,
Jan Kautz
,
Song Han
,
Trevor Darrell
,
Pavlo Molchanov
,
Hongxu (Danny) Yin
June 2025
Cite
arXiv
Website
Type
Conference paper
Publication
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Highlight
Sifei Liu
Jan Kautz
Team Leader
Pavlo Molchanov
Hongxu (Danny) Yin
Related
Scaling RL to Long Videos
3D Aware Region Prompted Vision Language Model
NVILA: Efficient Frontier Visual Language Models
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
Wolf: Dense Video Captioning with a World Summarization Framework
Cite
×