](https://deep-paper.org/en/paper/2412.16155/images/cover.png)
Bridging the Gap - How Generative Video Models Solve Impossible Pose Estimation Problems
Introduction: The Human Ability to “Hallucinate” Geometry Imagine you are standing in a classroom. You take a photo of the blackboard at the front. Then, you turn around and walk to the back of the room, taking a photo of a student’s desk. These two photos have zero overlap—there are no common visual features between them. If you feed these two images into a traditional computer vision algorithm and ask, “Where is the second camera located relative to the first?”, the algorithm will fail. It looks for matching pixels, keypoints, or textures. Finding none, it cannot mathematically compute the geometry. ...
](https://deep-paper.org/en/paper/2412.01052/images/cover.png)
](https://deep-paper.org/en/paper/2504.10158/images/cover.png)
](https://deep-paper.org/en/paper/2503.00413/images/cover.png)
](https://deep-paper.org/en/paper/file-1950/images/cover.png)
](https://deep-paper.org/en/paper/file-1949/images/cover.png)
](https://deep-paper.org/en/paper/2503.05936/images/cover.png)
](https://deep-paper.org/en/paper/2504.19478/images/cover.png)
](https://deep-paper.org/en/paper/2411.16170/images/cover.png)
](https://deep-paper.org/en/paper/2504.11230/images/cover.png)
](https://deep-paper.org/en/paper/2502.20732/images/cover.png)
](https://deep-paper.org/en/paper/file-1943/images/cover.png)
](https://deep-paper.org/en/paper/2405.20216/images/cover.png)
](https://deep-paper.org/en/paper/2411.19474/images/cover.png)
](https://deep-paper.org/en/paper/2504.01786/images/cover.png)
](https://deep-paper.org/en/paper/2502.20161/images/cover.png)
](https://deep-paper.org/en/paper/file-1938/images/cover.png)
](https://deep-paper.org/en/paper/2502.19694/images/cover.png)
](https://deep-paper.org/en/paper/2503.19340/images/cover.png)
](https://deep-paper.org/en/paper/2412.04616/images/cover.png)