](https://deep-paper.org/en/papers/2025-10/2508.20722/images/cover.png)
rStar2-Agent: Teaching AI to Think Smarter, Not Just Longer
In the quest for more intelligent AI, we’ve often equated thinking with generating longer and more detailed chains of thought. The prevailing idea was: if a model “thinks longer,” it will eventually arrive at the right answer. This approach has driven substantial progress — but it has a fundamental ceiling. For truly complex problems — those that require creative leaps, checking intermediate steps, or course-correcting from a flawed path — simply extending a monologue isn’t enough. ...