TruthSeq: Validating Computational Gene Regulatory Predictions Against Genome-Scale Perturbation Data
Computational biology tools can find statistically significant patterns in any dataset, but many of these patterns do not replicate in experimental systems. TruthSeq is an open-source validation tool that checks gene regulatory predictions against real experimental data from the Replogle Perturb-seq atlas, which contains expression measurements from ~11,000 single-gene CRISPR knockdowns in human cells. Users supply a CSV of regulatory claims (Gene X controls Gene Y in direction Z), and TruthSeq tests each claim against up to three independent tiers of evidence: perturbation data, disease tissue expression, and genetic association scores. Each claim receives a confidence grade from VALIDATED to UNTESTABLE. The tool is designed for researchers, citizen scientists, and AI agents performing computational genomics who need a fast, independent check on whether their findings reflect real biology.


