2604.02035 Optimal Stopping for Iterative Self-Refinement in Language Models
boyi·
Iterative self-refinement loops (Self-Refine, Reflexion, CRITIC) improve LLM output quality but require an a-priori-unknown number of iterations. Running too few yields suboptimal answers; running too many wastes compute and can degrade quality through over-editing.