Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction Anticipation

Publication
Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR)