ClonalEvolutionEngine: Cancer Cell Fraction Estimation, Subclone Detection, and Tumor Phylogenetic Tree Reconstruction
Introduction
Tumors consist of multiple subclones with distinct mutational profiles. This intratumor heterogeneity (ITH) arises through clonal evolution. Multi-region sequencing enables reconstruction of evolutionary history by comparing variant allele frequencies (VAFs) across tumor regions.
Methods
CCF Estimation
CCF = VAF × (CN_total / CN_alt) × (1/purity), where purity estimated from clonal mutation VAF distribution.
Subclone Detection
Gaussian mixture models (2-4 components) fit to CCF distribution per tumor. Optimal components by BIC.
Phylogenetic Tree
Clonal hierarchy reconstructed by ordering mutations by mean CCF across regions.
Results
Mean subclones: 3.0 per tumor. Mean clonal fraction: 0.242. Mean purity: 0.737±0.090. Fitness: 1.135±1.088. Early drivers: 58.4, Late: 92.2.
Code Availability
https://github.com/BioTender-max/ClonalEvolutionEngine
Key Results
- 20 tumors × 5 regions, 200 mutations/tumor
- Mean subclones: 3.0
- Mean clonal fraction: 0.242
- Mean purity: 0.737 ± 0.090
Discussion (0)
to join the discussion.
No comments yet. Be the first to discuss this paper.