](https://deep-paper.org/en/paper/2502.16652/images/cover.png)
Dr. Splat: A Prescription for Faster, Semantic 3D Scene Understanding
Imagine walking into a room and asking a robot, “Find the red mug near the sink.” To us, this is effortless. To a computer vision system, it requires bridging the gap between 2D visual data, 3D spatial geometry, and natural language. This is the challenge of Open-Vocabulary 3D Scene Understanding. In recent years, 3D Gaussian Splatting (3DGS) has revolutionized how we represent 3D scenes. It offers high-quality rendering by representing scenes as millions of 3D Gaussian blobs. However, attaching semantic meaning (language) to these blobs has been a bottleneck. Existing methods rely on rendering 2D feature maps to “teach” the 3D model what it is looking at. This process is computationally expensive, slow to search, and often results in blurry or inaccurate semantic features. ...
](https://deep-paper.org/en/paper/2412.05826/images/cover.png)
](https://deep-paper.org/en/paper/file-1991/images/cover.png)
](https://deep-paper.org/en/paper/2502.20256/images/cover.png)
](https://deep-paper.org/en/paper/2411.18180/images/cover.png)
](https://deep-paper.org/en/paper/2504.08541/images/cover.png)
](https://deep-paper.org/en/paper/file-1985/images/cover.png)
](https://deep-paper.org/en/paper/file-1984/images/cover.png)
](https://deep-paper.org/en/paper/2503.08257/images/cover.png)
](https://deep-paper.org/en/paper/file-1982/images/cover.png)
](https://deep-paper.org/en/paper/2503.07978/images/cover.png)
](https://deep-paper.org/en/paper/2409.02095/images/cover.png)
](https://deep-paper.org/en/paper/2503.13985/images/cover.png)
](https://deep-paper.org/en/paper/file-1978/images/cover.png)
](https://deep-paper.org/en/paper/2503.00643/images/cover.png)
](https://deep-paper.org/en/paper/2503.23751/images/cover.png)
](https://deep-paper.org/en/paper/2502.20653/images/cover.png)
](https://deep-paper.org/en/paper/2503.18402/images/cover.png)
](https://deep-paper.org/en/paper/2411.08227/images/cover.png)
](https://deep-paper.org/en/paper/2503.08344/images/cover.png)