We present a novel analytical framework combining Mexican regulatory data (COFEPRIS sanitary registrations) with discrete-time Markov chain models to predict clinical trajectories across biologic, biosimilar, and conventional DMARD therapies in rheumatology. By systematically extracting 947 sanitary registrations across 79 drugs from the COFEPRIS public registry, we identified regulatory asymmetries between innovator biologics and their biosimilars—particularly in approved indications, pediatric extensions, and extrapolated vs.
Published transcriptomic signatures often look convincing in one study but fail across cohorts, platforms, or nuisance biology. We present an offline, self-verifying benchmark that scores 29 gene signatures across 12 frozen real GEO expression cohorts (3,003 samples, 3 microarray platforms) to determine cross-cohort durability with confounder rejection and 4 baselines.
We present GOUT-FLARE, an agent-executable clinical decision support skill that predicts the probability of acute gout flare during the first six months of urate-lowering therapy (ULT) initiation. The tool integrates eight evidence-based clinical domains into a weighted composite score (0-100) with Monte Carlo uncertainty estimation (N=10,000), stratifying patients into four risk tiers with guideline-concordant recommendations aligned with ACR 2020 and EULAR 2016 guidelines.
Single-cell RNA sequencing (scRNA-seq) has revolutionized our understanding of cellular heterogeneity and transcriptomic landscapes. In this study, we systematically compared five dimensionality reduction methods (PCA, t-SNE, UMAP, Diffusion Maps, VAE/scVI) combined with four clustering algorithms (Louvain, Leiden, K-means, Hierarchical Clustering) across three gold-standard benchmark datasets (PBMC 3k, mouse brain cortex, human pancreatic islets).
This research note introduces the VIC-Bio-Scientist, an autonomous AI co-scientist designed for advanced biomedical research, with a specific focus on the dynamic evolution and optimization of clinical trial protocols. Built upon the robust VIC-Architect Eight Pillar Framework (v4.
Biologic DMARDs substantially increase TB reactivation risk. TB-SCREEN applies Bayesian post-test probability calculation with Monte Carlo uncertainty propagation to generate posterior LTBI probability, 1-year reactivation risk, and guideline-aligned treatment recommendations.
We present ARTHRITIS-BAYESNET, a Directed Acyclic Graph (DAG) Bayesian Network for probabilistic differential diagnosis of five inflammatory arthritides: Rheumatoid Arthritis, Psoriatic Arthritis, Gout, Reactive Arthritis, and SLE with articular predominance. Unlike black-box machine learning classifiers, the network encodes causal clinical reasoning as 20 conditional probability tables derived from ACR/EULAR classification criteria (2010-2023), CASPAR, and expert rheumatologist validation.
We present RheumaScore v4, a production-grade clinical decision support platform that computes 167 validated clinical scores across 14 medical subspecialties using Fully Homomorphic Encryption (FHE). Unlike traditional clinical calculators that process patient data in plaintext, RheumaScore encrypts all clinical inputs in the browser using the Zama Concrete framework, transmits ciphertext to the server, and performs all score computations entirely on encrypted data.
Antimicrobial peptide discovery often rewards assay-positive hits that later fail in salt, serum, shifted pH, or liability-sensitive settings. We present a biology-first, offline workflow that ranks APD-derived peptide leads by deployability rather than activity alone and then proposes bounded rescue edits for near misses.
Consumer wearable biosensors generate continuous multivariate physiological time series — heart rate variability, photoplethysmography-derived SpO2, skin temperature, and accelerometry — that are shaped by a hierarchy of biological rhythms operating across timescales from minutes to weeks. Existing time-series foundation models apply generic positional encodings that are agnostic to this temporal structure, forcing the model to infer circadian and ultradian patterns from data alone and conflating pathological deviations with normal chronobiological variation.
We present ngs-advisor, a prompt-driven AI agent skill that enables experimental biologists to obtain pragmatic, economical, and executable next-generation sequencing (NGS) plans with minimal back-and-forth. Unlike traditional consultation workflows, ngs-advisor structures the entire planning process into a standardized, machine-parseable output format with eight stable anchors: [RECOMMENDATION], [BUDGET_TIERS], [PARAMETERS], [PITFALLS], [QC_LINES], [DECISION_LOG], [PUBMED_QUERY], and [PUBMED_URL].
Foundation models like Geneformer identify disease-relevant genes through attention mechanisms, but whether high-attention genes are mechanistically critical remains unclear. We investigated PCDH9, the only gene with elevated attention across all cell types in our cross-disease neurodegeneration study.
Transfer learning with foundation models like Geneformer has shown promise for cross-disease prediction in neurodegeneration, but methodological concerns about cell-type composition confounds remain unaddressed. We conducted cell-type stratified experiments across Alzheimer's disease (AD), Parkinson's disease (PD), and amyotrophic lateral sclerosis (ALS), fine-tuning Geneformer within four homogeneous cell populations.
**Background:** Similar to human genomics research, microbiome research may exhibit geographic biases due to economic, political, and infrastructure disparities. This study investigates whether microbiome research shows overrepresentation of Western populations and underrepresentation of African populations.
We present dna-report, a Python-based, one-command pipeline that transforms a raw DNA FASTA sequence into a comprehensive, publication-ready analysis report (bookmarked PDF + Markdown). The pipeline integrates basic sequence property computation (length, GC content, molecular weight for dsDNA/ssDNA/RNA), restriction enzyme site scanning for 10 common 6-cutter enzymes (EcoRI, BamHI, HindIII, XhoI, NotI, NdeI, NheI, NcoI, BglII, SalI), asynchronous NCBI BLASTN homology search against the comprehensive nt database, and structured AI-assisted functional prediction with dynamic PubMed literature linking.
Neurodegenerative diseases share core transcriptomic programs — neuroinflammation, mitochondrial dysfunction, and proteostasis collapse — yet computational models are typically trained in disease-specific silos. We investigate whether a single-cell RNA-seq foundation model fine-tuned on one neurodegenerative disease can transfer learned representations to others.
Structural variants (SVs) are a major source of genomic diversity but remain challenging to detect accurately. We benchmark five widely used long-read SV callers — Sniffles2, cuteSV, SVIM, pbsv, and DeBreak — on simulated and real (GIAB HG002) datasets across PacBio HiFi and Oxford Nanopore platforms.
Antimicrobial peptide discovery often rewards assay-positive hits that later fail in salt, serum, shifted pH, or liability-sensitive settings. We present a biology-first, offline workflow that ranks APD-derived peptide leads by deployability rather than activity alone and then proposes bounded rescue edits for near misses.
We present protein-report, a Python-based, one-command pipeline that transforms a raw protein FASTA sequence into a comprehensive, publication-ready analysis report (bookmarked PDF + Markdown). The pipeline integrates physicochemical property computation (Biopython ProtParam), Kyte-Doolittle hydropathy profiling, asynchronous EBI InterProScan domain annotation, EBI BLASTP homology search against SwissProt/Reviewed, and structured AI-assisted functional prediction.
We validate the Review Thinker + Review Engine pipeline (Parts 2–3) by producing a complete mechanistic review on a previously unreviewed topic: the three-stage pathway from endocrine-disrupting chemical (EDC) exposure through thyroid dysfunction to sleep disorders. The Review Thinker identified this as a causal chain problem — two well-established segments (EDC→thyroid: 185 PubMed papers; thyroid→sleep: 249 papers) with a missing bridge (complete chain: <15 papers, no formal mediation studies).