Detecting Alu Insertions from High-throughput Sequencing Data
Overview
Authors
Affiliations
High-throughput sequencing technologies have allowed for the cataloguing of variation in personal human genomes. In this manuscript, we present alu-detect, a tool that combines read-pair and split-read information to detect novel Alus and their precise breakpoints directly from either whole-genome or whole-exome sequencing data while also identifying insertions directly in the vicinity of existing Alus. To set the parameters of our method, we use simulation of a faux reference, which allows us to compute the precision and recall of various parameter settings using real sequencing data. Applying our method to 100 bp paired Illumina data from seven individuals, including two trios, we detected on average 1519 novel Alus per sample. Based on the faux-reference simulation, we estimate that our method has 97% precision and 85% recall. We identify 808 novel Alus not previously described in other studies. We also demonstrate the use of alu-detect to study the local sequence and global location preferences for novel Alu insertions.
Zhu B, Zhou J, He H, Liao Y, Li Q Heliyon. 2024; 10(16):e35530.
PMID: 39220964 PMC: 11365318. DOI: 10.1016/j.heliyon.2024.e35530.
Lee H, Min J, Mun S, Han K Life (Basel). 2022; 12(10).
PMID: 36295018 PMC: 9605557. DOI: 10.3390/life12101583.
AluMine: alignment-free method for the discovery of polymorphic Alu element insertions.
Puurand T, Kukuskina V, Pajuste F, Remm M Mob DNA. 2019; 10:31.
PMID: 31360240 PMC: 6639938. DOI: 10.1186/s13100-019-0174-3.
iMGEins: detecting novel mobile genetic elements inserted in individual genomes.
Bae J, Lee K, Islam M, Yim H, Park H, Rho M BMC Genomics. 2018; 19(1):944.
PMID: 30563451 PMC: 6299635. DOI: 10.1186/s12864-018-5290-9.
Discovery of rare, diagnostic Yb8/9 elements in diverse human populations.
Feusier J, Witherspoon D, Watkins W, Goubert C, Sasani T, Jorde L Mob DNA. 2017; 8:9.
PMID: 28770012 PMC: 5531096. DOI: 10.1186/s13100-017-0093-0.