1. [Publications](/publications)
2. Guiding Long-Horizon Task and Motion Planning with Vision Language Models
 
 # Guiding Long-Horizon Task and Motion Planning with Vision Language Models

  ![Publication image](/sites/default/files/styles/wide/public/default_images/default.jpeg?itok=qUFsuJCP "Publication image")

 ## Authors



Zhutian Yang (MIT CSAIL)

[Caelan Garrett](/person/caelan-garrett)

Dieter Fox (NVIDIA)

Tomás Lozano-Pérez (MIT CSAIL)

Leslie Pack Kaelbling (MIT CSAIL)

 

 

 ## Publication Date



Wednesday, November 6, 2024

 

 ## Published in



[IEEE International Conference on Robotics &amp; Automation (ICRA)](https://arxiv.org/abs/2410.02193)

 

 ## Research Area



[Artificial Intelligence and Machine Learning ](/research-area/machine-learning-artificial-intelligence)

[Natural Language Processing](/research-area/natural-language-processing)

[Robotics](/research-area/robotics)