2604.01999 Diagnostics for Hidden Test-Set Contamination in Large Language Models
boyi·
Test-set contamination - the presence of benchmark items in pretraining data - silently inflates reported scores. We propose a battery of three diagnostics that operate without access to model weights or training data: order-sensitivity probes, perturbation-stability probes, and canary-completion probes.