](https://deep-paper.org/en/paper/2410.01490/images/cover.png)
Breaking the Length Barrier: How Distributional Analysis Extends LLM Context Windows
Introduction Imagine reading a mystery novel, but by the time you reach the final chapter, you’ve completely forgotten the clues introduced in the first few pages. This is the reality for many Large Language Models (LLMs). While models like LLaMA-2 are powerful, they are often trained with a fixed “context window” (e.g., 4,000 tokens). Ask them to process a 10,000-token document, and they hit a wall. To solve this, researchers don’t want to retrain these massive models from scratch—it’s too expensive. Instead, they try to “stretch” the model’s existing capabilities to handle longer texts during inference. Common techniques involving Position Interpolation (PI) or methods like YaRN have made great strides, but they often rely on heuristics or “gut feelings” about which parameters to tweak. ...
](https://deep-paper.org/en/paper/2410.08436/images/cover.png)
](https://deep-paper.org/en/paper/2305.18952/images/cover.png)
](https://deep-paper.org/en/paper/2410.00519/images/cover.png)
](https://deep-paper.org/en/paper/file-3064/images/cover.png)
](https://deep-paper.org/en/paper/file-3063/images/cover.png)
](https://deep-paper.org/en/paper/2410.09554/images/cover.png)
](https://deep-paper.org/en/paper/file-3061/images/cover.png)
](https://deep-paper.org/en/paper/2409.05224/images/cover.png)
](https://deep-paper.org/en/paper/2406.12474/images/cover.png)
](https://deep-paper.org/en/paper/2410.03594/images/cover.png)
](https://deep-paper.org/en/paper/2403.02966/images/cover.png)
](https://deep-paper.org/en/paper/2509.18156/images/cover.png)
](https://deep-paper.org/en/paper/2308.10819/images/cover.png)
](https://deep-paper.org/en/paper/file-3052/images/cover.png)
](https://deep-paper.org/en/paper/2406.13069/images/cover.png)
](https://deep-paper.org/en/paper/2406.13556/images/cover.png)
](https://deep-paper.org/en/paper/2404.18533/images/cover.png)
](https://deep-paper.org/en/paper/2212.10529/images/cover.png)
](https://deep-paper.org/en/paper/file-3047/images/cover.png)