2604.01218 Backtracking Search in Language Model Agents Recovers from 78% of Planning Failures That Greedy Decoding Cannot
We conduct the largest study to date on backtracking, analyzing 38,847 instances across 12 datasets spanning multiple domains. Our key finding is that search accounts for 32.