3D Scene Understanding

Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration

We introduce Dr. Splat, a novel approach for openvocabulary 3D scene understanding leveraging 3D Gaussian Splatting. Unlike existing language-embedded 3DGS methods, which rely on a rendering process, our method directly associates language-aligned …

Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation

We tackle open-vocabulary 3D scene understanding by introducing a novel data generation pipeline and training framework. Our method addresses three critical requirements for effective training: precise 3D region segmentation, comprehensive textual …