Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Min Shi
,
Fuxiao Liu
,
Shihao Wang
,
Shijia Liao
,
Subhashree Radhakrishnan
,
De-An Huang
,
Hongxu (Danny) Yin
,
Karan Sapra
,
Yaser Yacoob
,
Humphrey Shi
,
Bryan Catanzaro
,
Andrew Tao
,
Jan Kautz
,
Guilin Liu
,
Zhiding Yu
April 2025
Cite
arXiv
Type
Conference paper
Publication
International Conference on Learning Representations (ICLR)
De-An Huang
Hongxu (Danny) Yin
Jan Kautz
Team Leader
Zhiding Yu
Related
LITA: Language Instructed Temporal-localization Assistant
Partial Convolution for Padding, Inpainting, and Image Synthesis
Transposer: Universal Texture Synthesis Using Feature Maps as Transposed Convolution Filter
Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counter Factual Reasoning
Cite
×