Home
Publications
NVIDIA Research
Light
Dark
Automatic
Scaling Parallel Sequence Models to Vision Foundation Models
Yitong Jiang
,
Collin McCarthy
,
Hongjun Wang
,
Hanrong Ye
,
Qi Dou
,
Tianfan Xue
,
Jinwei Gu
,
Jan Kautz
,
Hongxu (Danny) Yin
,
Pavlo Molchanov
,
Sifei Liu
June 2026
Cite
Pdf
Type
Conference paper
Publication
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Jan Kautz
Team Leader
Hongxu (Danny) Yin
Pavlo Molchanov
Sifei Liu
Related
GSPN-2: Efficient Parallel Sequence Modeling
Token-Efficient VLM: High-Resolution Image Understanding via Dynamic Region Proposal
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
Scaling RL to Long Videos
3D Aware Region Prompted Vision Language Model
Cite
×