NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele
Overview
Authors
Affiliations
Homology between mitochondrial DNA (mtDNA) and nuclear DNA of mitochondrial origin (nuMTs) causes confounding when aligning short sequence reads to the reference human genome, as the true sequence origin cannot be determined. Using a systematic approach, we here report the impact of all potential mitochondrial variants on alignment accuracy and variant calling. A total of 49,707 possible mutations were introduced across the 16,569 bp reference mitochondrial genome (16,569 × 3 alternative alleles), one variant at-at-time. The resulting fragmentation and alignment to the entire reference genome (GRCh38) revealed preferential mapping of mutated mitochondrial fragments to nuclear loci, as variants increased loci similarity to nuMTs, for a total of 807, 362, and 41 variants at 333, 144, and 27 positions when using 100, 150, and 300 bp single-end fragments. We subsequently modeled these affected variants at 50% heteroplasmy and carried out variant calling, observing bias in the reported allele frequencies in favor of the reference allele. Four variants (chrM:6023A, chrM:4456T, chrM:5147A, and chrM:7521A) including a possible hypertension factor, chrM:4456T, caused 100% loss of coverage at the mutated position (with all 100 bp single-end fragments aligning to homologous, nuclear positions instead of chrM), rendering these variants undetectable when aligning to the entire reference genome. Furthermore, four mitochondrial variants reported to be pathogenic were found to cause significant loss of coverage and select haplogroup-defining SNPs were shown to exacerbate the loss of coverage caused by surrounding variants. Increased fragment length and use of paired-end reads both improved alignment accuracy.
Cheung K, Rollins L, Hammond J, Barton K, Ferguson J, Eyck H Genome Biol Evol. 2024; 16(11).
PMID: 39548850 PMC: 11606642. DOI: 10.1093/gbe/evae246.
The quality and detection limits of mitochondrial heteroplasmy by long read nanopore sequencing.
Slapnik B, Sket R, crepinsek K, Tesovnik T, Jenko Bizjan B, Kovac J Sci Rep. 2024; 14(1):26778.
PMID: 39501054 PMC: 11538439. DOI: 10.1038/s41598-024-78270-0.
Characterization of Nuclear Mitochondrial Insertions in Canine Genome Assemblies.
Schall P, Meadows J, Ramos-Almodovar F, Kidd J Genes (Basel). 2024; 15(10).
PMID: 39457442 PMC: 11507379. DOI: 10.3390/genes15101318.
Dorji J, Chamberlain A, Reich C, VanderJagt C, Nguyen T, Daetwyler H Genet Sel Evol. 2024; 56(1):62.
PMID: 39266998 PMC: 11391750. DOI: 10.1186/s12711-024-00931-5.
Zhou W, Karan K, Gu W, Klein H, Sturm G, De Jager P PLoS Biol. 2024; 22(8):e3002723.
PMID: 39172952 PMC: 11340991. DOI: 10.1371/journal.pbio.3002723.