BOP-ASK: Object-Interaction Reasoning for Vision-Language Models

Publication
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)