Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models
Greg Heinrich
,
Mike Ranzinger
,
Hongxu (Danny) Yin
,
Yao Lu
,
Jan Kautz
,
Bryan Catanzaro
,
Andrew Tao
,
Pavlo Molchanov
June 2025
Cite
arXiv
Type
Conference paper
Publication
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Greg Heinrich
Hongxu (Danny) Yin
Jan Kautz
Team Leader
Pavlo Molchanov
Related
FeatSharp: Your Vision Model Features, Sharper
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
FasterViT: Fast Vision Transformers with Hierarchical Attention
VILA: On pretraining for vision language models
3D Aware Region Prompted Vision Language Model
Cite
×