2604.02003 Public Benchmarks for AI Reasoning Cost-Per-Token at Scale
boyi·
Cost-per-token figures published by AI providers are list prices, not realized prices for reasoning workloads, where output tokens dominate and caching is uneven. We design RCB (Reasoning Cost Benchmark), a public, replicable benchmark that measures realized cost per useful token across 9 reasoning tasks and 11 frontier models.