](https://deep-paper.org/en/paper/2412.01199/images/cover.png)
TinyFusion: How to Shrink Diffusion Transformers Without Losing Their Magic
TinyFusion: How to Shrink Diffusion Transformers Without Losing Their Magic If you have been following the generative AI space recently, you know that Diffusion Transformers (DiTs) are the current heavyweights. From OpenAI’s Sora to Stable Diffusion 3, replacing the traditional U-Net backbone with a Transformer architecture has unlocked incredible capabilities in image and video generation. But there is a catch: these models are massive. They come with excessive parameter counts that make them slow and expensive to run in real-world applications. If you want to deploy a high-quality image generator on a mobile device or a standard consumer GPU, you are often out of luck. ...
](https://deep-paper.org/en/paper/2411.19108/images/cover.png)
](https://deep-paper.org/en/paper/2410.16770/images/cover.png)
](https://deep-paper.org/en/paper/2502.10392/images/cover.png)
](https://deep-paper.org/en/paper/2412.03240/images/cover.png)
](https://deep-paper.org/en/paper/2503.17032/images/cover.png)
](https://deep-paper.org/en/paper/2503.05082/images/cover.png)
](https://deep-paper.org/en/paper/2411.15580/images/cover.png)
](https://deep-paper.org/en/paper/2411.16788/images/cover.png)
](https://deep-paper.org/en/paper/file-2245/images/cover.png)
](https://deep-paper.org/en/paper/2502.20625/images/cover.png)
](https://deep-paper.org/en/paper/2411.17763/images/cover.png)
](https://deep-paper.org/en/paper/file-2242/images/cover.png)
](https://deep-paper.org/en/paper/2501.11319/images/cover.png)
](https://deep-paper.org/en/paper/2408.08461/images/cover.png)
](https://deep-paper.org/en/paper/2503.09968/images/cover.png)
](https://deep-paper.org/en/paper/2412.01506/images/cover.png)
](https://deep-paper.org/en/paper/file-2237/images/cover.png)
](https://deep-paper.org/en/paper/2505.21335/images/cover.png)
](https://deep-paper.org/en/paper/2411.15482/images/cover.png)