Replicate Sequencing Libraries Are Important for Quantification of Allelic Imbalance
Overview
Authors
Affiliations
A sensitive approach to quantitative analysis of transcriptional regulation in diploid organisms is analysis of allelic imbalance (AI) in RNA sequencing (RNA-seq) data. A near-universal practice in such studies is to prepare and sequence only one library per RNA sample. We present theoretical and experimental evidence that data from a single RNA-seq library is insufficient for reliable quantification of the contribution of technical noise to the observed AI signal; consequently, reliance on one-replicate experimental design can lead to unaccounted-for variation in error rates in allele-specific analysis. We develop a computational approach, Qllelic, that accurately accounts for technical noise by making use of replicate RNA-seq libraries. Testing on new and existing datasets shows that application of Qllelic greatly decreases false positive rate in allele-specific analysis while conserving appropriate signal, and thus greatly improves reproducibility of AI estimates. We explore sources of technical overdispersion in observed AI signal and conclude by discussing design of RNA-seq studies addressing two biologically important questions: quantification of transcriptome-wide AI in one sample, and differential analysis of allele-specific expression between samples.
Monoallelic expression can govern penetrance of inborn errors of immunity.
Stewart O, Gruber C, Randolph H, Patel R, Ramba M, Calzoni E Nature. 2025; 637(8048):1186-1197.
PMID: 39743591 PMC: 11804961. DOI: 10.1038/s41586-024-08346-4.
Mechanisms of transcriptional regulation in revealed by allele-specific expression.
Dyer N, Lucas E, Nagi S, McDermott D, Brenas J, Miles A Proc Biol Sci. 2024; 291(2031):20241142.
PMID: 39288798 PMC: 11407855. DOI: 10.1098/rspb.2024.1142.
Genetic regulation of nascent RNA maturation revealed by direct RNA nanopore sequencing.
Choquet K, Chaumont L, Bache S, Baxter-Koenigs A, Churchman L bioRxiv. 2024; .
PMID: 39257732 PMC: 11383983. DOI: 10.1101/2024.08.29.610338.
Multi-sample non-negative spatial factorization.
Wang Y, Woyshner K, Sriworarat C, Stein-OBrien G, Goff L, Hansen K bioRxiv. 2024; .
PMID: 39005356 PMC: 11244884. DOI: 10.1101/2024.07.01.599554.
Mononen J, Taipale M, Malinen M, Velidendla B, Niskanen E, Levonen A Nucleic Acids Res. 2023; 52(6):2904-2923.
PMID: 38153160 PMC: 11014276. DOI: 10.1093/nar/gkad1225.