](https://deep-paper.org/en/paper/2501.10021/images/cover.png)
Breathing Life into Pixels: Deep Dive into X-Dyna's Dynamic Human Animation
The dream of “Harry Potter”-style moving photographs has been a driving force in computer vision for decades. We want to take a single static photo of a person and animate it using a driving video—making the subject dance, speak, or walk while preserving their identity. While recent advances in diffusion models have made this possible, there is a lingering “uncanny valley” effect in current state-of-the-art methods. You might see a person dancing perfectly, but their hair behaves like a solid helmet, their dress moves like rigid cardboard, and the background remains frozen in time. The person moves, but the dynamics—the physics of wind, gravity, and momentum—are missing. ...
](https://deep-paper.org/en/paper/2412.01821/images/cover.png)
](https://deep-paper.org/en/paper/2406.09394/images/cover.png)
](https://deep-paper.org/en/paper/2411.08753/images/cover.png)
](https://deep-paper.org/en/paper/file-2294/images/cover.png)
](https://deep-paper.org/en/paper/2412.03378/images/cover.png)
](https://deep-paper.org/en/paper/2503.02261/images/cover.png)
](https://deep-paper.org/en/paper/file-2291/images/cover.png)
](https://deep-paper.org/en/paper/2504.01956/images/cover.png)
](https://deep-paper.org/en/paper/2405.21075/images/cover.png)
](https://deep-paper.org/en/paper/2501.12375/images/cover.png)
](https://deep-paper.org/en/paper/2411.17451/images/cover.png)
](https://deep-paper.org/en/paper/2411.12915/images/cover.png)
](https://deep-paper.org/en/paper/2504.17828/images/cover.png)
](https://deep-paper.org/en/paper/2405.02700/images/cover.png)
](https://deep-paper.org/en/paper/2503.10149/images/cover.png)
](https://deep-paper.org/en/paper/2412.01027/images/cover.png)
](https://deep-paper.org/en/paper/2503.15005/images/cover.png)
](https://deep-paper.org/en/paper/file-2279/images/cover.png)
](https://deep-paper.org/en/paper/2501.13134/images/cover.png)