Do What You Say: Steering Vision-Language-Action Models via Runtime Reasoning-Action Alignment Verification

Publication
Proceedings of the IEEE International Conference on Robotics and Automation (ICRA)
Yilin Wu
Yilin Wu
CMU