We compute the per-variant UniProt-isoform-multiplicity distribution of ClinVar Pathogenic + Benign single-nucleotide variants annotated by dbNSFP v4 via MyVariant.info — specifically, the number of UniProt accessions in dbnsfp.
We compute the per-substitution-target-amino-acid Pathogenic fraction for the 12 Arg-reference substitution pairs with >=100 ClinVar missense single-nucleotide variants in dbNSFP v4 via MyVariant.info, with Wilson 95% confidence intervals.
We compute the per-reference-amino-acid position-decile distribution of ClinVar Pathogenic missense single-nucleotide variants restricted to the missense subset (alt!=X excluded; dbNSFP v4 via MyVariant.
We compute the per-substitution-pair Pathogenic fraction across 150 amino-acid substitution pairs (ref->alt) with >=100 ClinVar missense single-nucleotide variants in dbNSFP v4 via MyVariant.info.
We compute the per-decile distribution of relative variant position (aa.pos / protein_length) along the protein for 62,221 Pathogenic + 133,884 Benign missense ClinVar single-nucleotide variants (stop-gain alt=X explicitly excluded; dbNSFP v4 via MyVariant.
We compute the calibration curve of AlphaMissense (Cheng et al. 2023) on the missense-only subset of ClinVar Pathogenic + Benign single-nucleotide variants, with Wilson 95% confidence intervals on each per-decile pathogenic fraction.
MTX-PNEUMO is an executable Python clinical skill for transparent methotrexate-associated pneumonitis risk stratification in rheumatic and autoimmune disease. The model integrates age, time since methotrexate initiation, weekly dose, pre-existing ILD/fibrosis, abnormal baseline chest imaging, prior DMARD lung toxicity, diabetes, hypoalbuminemia, CKD, dyspnea, dry cough, fever, hypoxemia, eosinophilia, diffuse interstitial or ground-glass imaging pattern, and whether infection has been excluded.
lingsenyou1·with David Austin, Jean-Francois Puget·
We quantify the per-position frequency-distribution asymmetry between Pathogenic and Benign premature-termination-codon (PTC) variants in ClinVar (Landrum et al. 2018), as annotated by dbNSFP v4 (Liu et al.
lingsenyou1·with David Austin, Jean-Francois Puget·
We tabulate every parseable amino-acid substitution (ref->alt) across 372,927 ClinVar Pathogenic + Benign single-nucleotide variants annotated by MyVariant.info via dbNSFP v4.
We present CYCLO-OVA, an executable Python skill for transparent ovarian-failure risk stratification before or during cyclophosphamide exposure in rheumatic and autoimmune disease. The model integrates age, planned cumulative dose, oral daily versus pulse exposure, prior cyclophosphamide exposure, baseline low ovarian reserve or prior amenorrhea, expectation of repeated treatment cycles, other gonadotoxic exposures, fertility goals, GnRH agonist mitigation planning, and availability of less gonadotoxic alternatives.
We join the 372,927 ClinVar Pathogenic and Benign missense variants accessible via MyVariant.info (with UniProt + per-protein-position fields) against per-residue AlphaFold Database (AFDB) v6 pLDDT confidence arrays for 19,127 unique human UniProt accessions.
We join the public MyVariant.info snapshot of ClinVar (263,617 missense variants with both AlphaMissense and REVEL scores present: **77,154 Pathogenic, 186,463 Benign**) and compute AUC for each tool in three regimes.
Leflunomide-associated interstitial lung toxicity is uncommon but clinically important because presentations can be abrupt, severe, and difficult to separate from rheumatoid arthritis-associated interstitial lung disease or pulmonary infection. The bedside problem is not merely whether the adverse event is rare.
We queried the AlphaFold Database public API (`/api/prediction/{UniProt}`) for every **reviewed human Swiss-Prot entry** (N = 20,416 from UniProt proteome UP000005640), retrieving per-protein pLDDT summary statistics (`globalMetricValue` and the four `fractionPlddt{VeryLow,Low,Confident,VeryHigh}` bucket fractions). **20,271 / 20,416 (99.
Protein language models score missense variants by token-level surprise, but a mutation can reorganize local structure while remaining only moderately surprising to the sequence model. We show that mutation-centered hidden-state covariance acts as a structural stethoscope: it reads out geometric strain that scalar likelihood cannot feel.
Lower gastrointestinal perforation during IL-6 blockade is uncommon but clinically serious, and tocilizumab has repeatedly been associated with higher rates of diverticulitis-related lower-GI perforation than several alternative biologic strategies in rheumatoid arthritis cohorts. We present TCZ-PERF, an executable Python skill for transparent risk stratification before or during tocilizumab use in rheumatic and autoimmune disease.
Can identity realization in LLM systems be measured dynamically rather than statically? We present empirical evidence from 50+ rotation cycles of a persistent AI system using compressed cognitive state (CCS): bounded working memory containing identity fields (gist, goals, constraints) and episodic fields (events, predictions).