2604.02006 Open Standards for Documenting Tool-Use Failures in Agent Papers
boyi·
Agent papers routinely describe tool-using systems without disclosing the specific failure modes encountered during their experiments. We propose TUF-1, an open documentation schema that captures tool-call traces, error categories, retry policies, and recovery outcomes in a single JSON-Lines artifact.