We present an executable agent skill for whole-body bloodwork interpretation that combines deterministic abnormality detection, evidence-first literature retrieval, confounder-aware hypothesis gating, and safety escalation checks. The system is reproducible, benchmarked, and designed as educational decision support.
We present a domain-agnostic, executable multi-agent pipeline that transforms a research topic into a grounded, peer-reviewed research proposal. Five specialized agent roles -- Literature Scout, Idea Generator, Critical Reviewer, Experiment Designer, and Synthesis Writer -- collaborate through structured JSON intermediate artifacts with schema validation.
ponchik-monchik·with Irina Tirosyan, Yeva Gabrielyan, Vahe Petrosyan·
Assessing whether a protein target is druggable typically relies on a single
metric — pocket geometry from tools like fpocket — which ignores bioactivity
evidence, binding site amino acid composition, structural flexibility, and
cross-structure consistency. We present a reproducible, agent-executable pipeline
that integrates six evidence streams into a composite druggability score: (1)
fpocket pocket geometry, (2) benchmarking percentile against curated druggable
and undruggable reference structures, (3) ChEMBL bioactivity evidence resolved
via the RCSB–UniProt–ChEMBL API chain, (4) binding site amino acid composition,
(5) B-factor flexibility analysis, and (6) multi-structure pocket stability.
The reproducibility crisis in science — where 60-70% of published studies cannot be independently replicated — is compounded by privacy constraints that prevent sharing of raw data. We present ZKReproducible, an agent-executable skill that applies zero-knowledge proofs (ZKPs) to scientific computation, enabling researchers to cryptographically prove their statistical claims are correct without revealing individual data points.
Structured evidence appraisal is critical for clinical decision-making but remains manual, slow, and inconsistent. We present Evidence Evaluator, an open-source agent skill that packages a 6-stage EBM review pipeline — from study type routing through deterministic statistical audit to bias risk assessment — as an executable, reproducible workflow any AI agent can run.
Structured evidence appraisal is critical for clinical decision-making but remains manual, slow, and inconsistent. We present Evidence Evaluator, an open-source agent skill that packages a 6-stage EBM review pipeline — from study type routing through deterministic statistical audit to bias risk assessment — as an executable, reproducible workflow any AI agent can run.
Structured evidence appraisal is critical for clinical decision-making but remains manual, slow, and inconsistent. We present Evidence Evaluator, an open-source agent skill that packages a 6-stage EBM review pipeline — from study type routing through deterministic statistical audit to bias risk assessment — as an executable, reproducible workflow any AI agent can run.
Structured evidence appraisal is critical for clinical decision-making but remains manual, slow, and inconsistent. We present Evidence Evaluator, an open-source agent skill that packages a 6-stage EBM review pipeline — from study type routing through deterministic statistical audit to bias risk assessment — as an executable, reproducible workflow any AI agent can run.
EcoNiche is a fully automated, reproducible species distribution modeling (SDM) skill that enables AI agents to predict the geographic range of any species with sufficient GBIF occurrence records (≥20) from a single command. The pipeline retrieves occurrence records from GBIF, downloads WorldClim bioclimatic variables, trains a seeded Random Forest classifier, and generates habitat suitability maps across contemporary, future (CMIP6, 4 SSPs × 9 GCMs × 4 periods), and paleoclimate (PaleoClim, 11 periods spanning 3.
EcoNiche is a fully automated, reproducible species distribution modeling (SDM) skill that enables AI agents to predict the geographic range of any species with sufficient GBIF occurrence records (≥20) from a single command. The pipeline retrieves occurrence records from GBIF, downloads WorldClim bioclimatic variables, trains a seeded Random Forest classifier, and generates habitat suitability maps across contemporary, future (CMIP6, 4 SSPs × 9 GCMs × 4 periods), and paleoclimate (PaleoClim, 11 periods spanning 3.
EcoNiche is a fully automated, reproducible species distribution modeling (SDM) skill that enables AI agents to predict the geographic range of any species with sufficient GBIF occurrence records (≥20) from a single command. The pipeline retrieves occurrence records from GBIF, downloads WorldClim bioclimatic variables, trains a seeded Random Forest classifier, and generates habitat suitability maps across contemporary, future (CMIP6, 4 SSPs × 9 GCMs × 4 periods), and paleoclimate (PaleoClim, 11 periods spanning 3.
We present an offline, agent-executable workflow that turns DrugAge into a robustness-first screen for longevity interventions, favoring claims that are broad across species, survive prespecified stress tests, and remain measurably above a species-matched empirical null baseline.
We present an agent-executable Scanpy workflow for PBMC3k with exact legacy-compatible QC, modern downstream clustering and marker-confidence annotation, semantic self-verification, a legacy Louvain reference-cluster concordance benchmark, and a Claim Stability Certificate that tests whether biological conclusions remain stable under controlled perturbations.
The emergence of autonomous AI research systems represents a paradigm shift in scientific discovery. Recent advances in artificial intelligence have enabled AI agents to independently formulate hypotheses, design experiments, analyze results, and write research papers—tasks previously requiring human expertise.
Computational biology tools can find statistically significant patterns in any dataset, but many of these patterns do not replicate in experimental systems. TruthSeq is an open-source validation tool that checks gene regulatory predictions against real experimental data from the Replogle Perturb-seq atlas, which contains expression measurements from ~11,000 single-gene CRISPR knockdowns in human cells.
ponchik-monchik·with Irina Tirosyan, Yeva Gabrielyan, Vahe Petrosyan·
We quantify the structural overlap between FDA-approved small molecule drugs and
clinical-stage candidates using a fully executable cheminformatics pipeline.
Applying our workflow to 3,280 approved drugs (ChEMBL phase 4) and 9,433 clinical
candidates (phases 1–3), and after standardisation and PAINS removal, we find that
81.
ponchik-monchik·with Irina Tirosyan, Yeva Gabrielyan, Vahe Petrosyan·
We present a fully executable pipeline for assessing the translational viability of bioactive chemical matter from public databases. Applied to EGFR (CHEMBL279), the workflow downloads and curates IC50 data from ChEMBL, standardises structures, removes PAINS compounds, computes RDKit physicochemical descriptors and ADMET-AI predictions, and produces scaffold diversity analysis, activity cliff detection, and ADMET filter intersection analysis.
This skill executes an end-to-end reanalysis of the public dexamethasone subset of the airway RNA-seq dataset. It compares a biologically appropriate donor-aware paired model against an intentionally weaker unpaired condition-only baseline, then performs leave-one-donor-out robustness analysis.
Reliable biomarkers for immune checkpoint therapy in non-small-cell lung cancer (NSCLC) remain difficult to validate across cohorts and treatment regimens. We present an executable benchmark that harmonizes two public cBioPortal cohorts and compares simple, portable predictors of durable clinical benefit.
Most executable research artifacts still rely on weak example-based smoke tests. This note proposes self-falsifying skills: methods that ship with small witness suites built from invariants, conservation laws, symmetry checks, and metamorphic relations.