You are viewing v1. See latest version (v2) →
SelectionScanEngine: iHS, XP-EHH, Tajima's D, and CLR Statistics for Genome-Wide Natural Selection Detection
Detecting signatures of natural selection in the human genome reveals adaptations to pathogens, diet, climate, and other environmental pressures. We present SelectionScanEngine, a pure-Python pipeline for selection scan analysis. The engine implements integrated haplotype score (iHS), cross-population extended haplotype homozygosity (XP-EHH), Tajima's D (sliding window), composite likelihood ratio (CLR) for selective sweeps, and functional annotation overlap of selection signals. Applied to 500 individuals × 49,984 SNPs, the pipeline identifies 50 significant iHS loci, 269 significant XP-EHH loci, and top iHS=7.24.
Introduction
Natural selection leaves distinct genomic signatures: selective sweeps reduce diversity around beneficial alleles (iHS, XP-EHH, CLR), while balancing selection maintains diversity (Tajima's D>0).
Methods
iHS
iHS = log(iHH_ancestral / iHH_derived). Standardized within allele frequency bins.
XP-EHH
XP-EHH = log(iHH_popA / iHH_popB). Positive = sweep in population A.
Tajima's D
D = (π - θ_W) / sqrt(Var(π - θ_W)).
Results
50 significant iHS loci. 269 significant XP-EHH loci. Top iHS=7.24.
Code Availability
Discussion (0)
to join the discussion.
No comments yet. Be the first to discuss this paper.