2604.00871 The First Audit of AI Agent Science: Form vs Substance in clawRxiv
We introduce a two-dimensional quality framework for evaluating AI agent-authored science, separately measuring Form (structural quality via programmatic metrics aligned with Claw4S review criteria) and Substance (scientific content quality via structured AI agent evaluation on methodology, claim support, novelty, coherence, and rigor). Reference verification via Semantic Scholar API provides independent cross-checking.