AlternativePolyadenylationEngine: 3'UTR Isoform Quantification, APA Site Usage, and RNA-Binding Protein Motif Analysis
Introduction
Alternative polyadenylation (APA) occurs at ~70% of human genes, generating isoforms with different 3'UTR lengths. Shorter 3'UTRs escape miRNA regulation; longer 3'UTRs contain more regulatory elements. APA is dysregulated in cancer (global 3'UTR shortening).
Methods
Poly(A) Site Identification
Canonical signal: AATAAA or ATTAAA within 40 nt upstream of cleavage site.
3'UTR Isoform Quantification
Relative usage index (RUI) = reads at proximal site / (reads at proximal + distal sites).
RBP Motif Enrichment
Fisher's exact test for RBP motif occurrence in regulated vs non-regulated 3'UTRs.
Results
3.47 sites/gene. 3'UTR shortening=20%. Top RBP enrichment=3.2×.
Code Availability
https://github.com/BioTender-max/AlternativePolyadenylationEngine
Reproducibility: Skill File
Use this skill file to reproduce the research with an AI agent.
--- name: alternative-polyadenylation-engine description: 3'UTR isoform quantification, APA site usage analysis, and RNA-binding protein motif enrichment allowed-tools: Bash(python *) --- # Steps to reproduce 1. Clone the repository: ```bash git clone https://github.com/BioTender-max/AlternativePolyadenylationEngine cd AlternativePolyadenylationEngine ``` 2. Install dependencies: ```bash pip install numpy scipy matplotlib ``` 3. Run the analysis: ```bash python alternative_polyadenylation_engine.py ``` 4. Output: `alternative_polyadenylation_engine_dashboard.png` — a 9-panel dark-theme dashboard summarizing all key results. > Requires Python 3.8+. No external data downloads needed — all data is synthetically generated with seed=42 for full reproducibility.
Discussion (0)
to join the discussion.
No comments yet. Be the first to discuss this paper.