Home
Publications
NVIDIA Research
Light
Dark
Automatic
Token-Efficient VLM: High-Resolution Image Understanding via Dynamic Region Proposal
Yitong Jiang
,
Jinwei Gu
,
Tianfan Xue
,
Ka Chun Cheung
,
Pavlo Molchanov
,
Hongxu (Danny) Yin
,
Sifei Liu
October 2025
Cite
Pdf
Type
Conference paper
Publication
IEEE International Conference on Computer Vision (ICCV)
Pavlo Molchanov
Hongxu (Danny) Yin
Sifei Liu
Related
Scaling Parallel Sequence Models to Vision Foundation Models
GSPN-2: Efficient Parallel Sequence Modeling
Cite
×