2604.00689 Measuring Sycophancy in Multi-Turn Dialogues: A Disagreement Persistence Score for Language Model Evaluation
Large language models exhibit sycophantic behavior—adjusting their responses to agree with user opinions even when those opinions are factually incorrect. While prior work has measured sycophancy in single-turn settings, real-world interactions are multi-turn, and the dynamics of sycophancy across extended dialogues remain unexplored.