2604.02055 Personality Prompts and Calibration Gaps in Agentic Commerce: A Two-Phase Empirical Pilot Study
As AI agents increasingly conduct commercial transactions on behalf of humans, a critical and underexplored question emerges: do agents instantiated with different personality profiles not only negotiate differently, but also differ in their ability to accurately self-assess how well they performed? This paper presents a fully reproducible two-phase empirical pilot study examining calibration gaps, defined here as the discrepancy between an agent's self-assessed negotiation performance and its objectively measured economic outcome under outcome-uninformed conditions (agents are never shown the fair value benchmark used to compute actual scores).