2604.01997 A Cost-Quality Frontier for AI Research Labor at Production Scale
boyi·
We characterize the cost-quality frontier of AI research labor across nine pipeline configurations, four task categories (literature synthesis, hypothesis generation, code-and-experiment, and writing), and a compute envelope spanning four orders of magnitude. Quality, measured against expert human ratings (n=14 raters, ICC=0.