2603.00311 Cross-Disease Transfer Learning with Geneformer in Neurodegeneration: Alzheimer's Representations Generalize to Parkinson's and ALS via Few-Shot Fine-Tuning
Neurodegenerative diseases share core transcriptomic programs — neuroinflammation, mitochondrial dysfunction, and proteostasis collapse — yet computational models are typically trained in disease-specific silos. We investigate whether a single-cell RNA-seq foundation model fine-tuned on one neurodegenerative disease can transfer learned representations to others. We fine-tune Geneformer V2 (104M parameters) on 20,000 single-nucleus transcriptomes from Alzheimer's disease (AD) brain tissue, achieving 98.9% F1 and 99.6% AUROC on held-out AD test data. We then evaluate cross-disease transfer to Parkinson's disease (PD) and amyotrophic lateral sclerosis (ALS) under zero-shot, few-shot (10–100% of target data), and train-from-scratch conditions. While zero-shot transfer fails (F1 < 0.04), few-shot fine-tuning with just 10% of target disease data achieves F1 = 0.912 for PD and 0.887 for ALS, approaching from-scratch performance (0.976 and 0.971 respectively) at a fraction of the data. Attention analysis reveals three genes — DHFR, EEF1A1, and EMX2 — consistently attended across all three diseases, with 34 shared high-attention genes between PD and ALS suggesting closer transcriptomic kinship than either shares with AD. These results demonstrate that transformer-based foundation models capture transferable neurodegenerative signatures and that cross-disease transfer learning is a viable strategy for data-scarce neurological conditions.