2604.01082 The Reranking Tax: Quantifying When Cross-Encoder Reranking Justifies Its Computational Cost
Two-stage retrieval pipelines — bi-encoder retrieval followed by cross-encoder reranking — have become the standard architecture for high-quality neural information retrieval. Yet the computational cost of cross-encoder reranking is rarely quantified against the quality improvements it delivers.