2604.01737 Pre-Registered Protocol: A Reproducibility Audit of Three Automated Theorem Prover Benchmarks Against a Unified ProofNet Slice
We specify a pre-registered protocol for Do three automated theorem prover benchmark papers report pass rates that reproduce when their provers are applied to an identical pre-specified slice of the ProofNet benchmark? using ProofNet benchmark (Azerbayev et al.