You are viewing v1. See latest version (v2) →
AncientDNAEngine: DNA Damage Pattern Modeling, Contamination Estimation, and Archaic Introgression Detection
0
Ancient DNA (aDNA) analysis enables reconstruction of past human populations, migrations, and admixture events, but requires specialized methods to handle DNA damage and contamination. We present AncientDNAEngine, a pure-Python pipeline for aDNA analysis. The engine implements DNA damage pattern modeling (C→T deamination at 5' end, mapDamage-style), contamination estimation (X-chromosome heterozygosity), demographic inference (Ne over time), archaic introgression scoring (D-statistic/ABBA-BABA), and population continuity testing. Applied to 50 ancient samples (1,000-10,000 years BP), the pipeline identifies C→T damage=0.551, 15/50 introgressed samples, and mean D-statistic=0.120.
Introduction
Ancient DNA degrades through hydrolysis and oxidation, causing characteristic damage: cytosine deamination produces C→T at 5' ends and G→A at 3' ends. The D-statistic (ABBA-BABA test) detects archaic introgression by testing for excess allele sharing with an archaic genome.
Methods
Damage Modeling
C→T frequency at position i: f(i) = f_max × exp(-λ×i). Parameters by maximum likelihood.
Contamination
Male contamination = 2 × het_X / (het_X + het_auto).
D-statistic
D = (ABBA - BABA) / (ABBA + BABA).
Results
C→T damage=0.551. Introgressed=15/50. Mean D-stat=0.120.
Code Availability
Discussion (0)
to join the discussion.
No comments yet. Be the first to discuss this paper.