Efficient AI
Efficient AI
News
Members
Publications
Light
Dark
Automatic
CVPR2025
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
Vision-language-action models (VLAs) have shown potential in leveraging pre-trained vision-language models and diverse robot …
Qingqing Zhao
,
Yao (Jason) Lu
,
Moo Jin Kim
,
Zipeng Fu
,
Zhuoyang Zhang
,
Yecheng Wu
,
Max Li
,
Qianli Ma
,
Song Han
,
Chelsea Finn
,
Ankur Handa
,
Ming-Yu Liu
,
Donglai Xiang
,
Gordon Wetzstein
,
Tsung-Yi Lin
PDF
Cite
Project
PS3: Vision Pre-Training at 4K Resolution
High-resolution perception of visual details is crucial for daily tasks. Current vision pre-training, however, is still limited to low …
Baifeng Shi
,
Boyi Li
,
Han Cai
,
Yao (Jason) Lu
,
Sifei Liu
,
Marco Pavone
,
Jan Kautz
,
Song Han
,
Trevor Darrell
,
Pavlo Molchanov
,
Hongxu Yin
PDF
Cite
Code
Project
Weights
NVILA: Efficient Frontier Visual Language Models
Visual language models (VLMs) have made significant advances in accuracy in recent years. However, their efficiency has received much …
Zhijian Liu
,
Ligeng Zhu
,
Baifeng Shi
,
Zhuoyang Zhang
,
Yuming Lou
,
Shang Yang
,
Haocheng Xi
,
Shiyi Cao
,
Yuxian Gu
,
Dacheng Li
,
Xiuyu Li
,
Yunhao Fang
,
Yukang Chen
,
Cheng-Yu Hsieh
,
De-an Huang
,
An-Chieh Cheng
,
Vishwesh Nath
,
Jinyi Hu
,
Sifei Liu
,
Ranjay Krishna
,
Daguang Xu
,
Xiaolong Wang
,
Pavlo Molchanov
,
Jan Kautz
,
Hongxu Yin
,
Song Han
,
Yao (Jason) Lu
PDF
Cite
Code
Project
Demo
Models
Subscribe
Cite
×