» Articles » PMID: 35074858

Mitochondrial DNA Variation Across 56,434 Individuals in GnomAD

Abstract

Genomic databases of allele frequency are extremely helpful for evaluating clinical variants of unknown significance; however, until now, databases such as the Genome Aggregation Database (gnomAD) have focused on nuclear DNA and have ignored the mitochondrial genome (mtDNA). Here, we present a pipeline to call mtDNA variants that addresses three technical challenges: (1) detecting homoplasmic and heteroplasmic variants, present, respectively, in all or a fraction of mtDNA molecules; (2) circular mtDNA genome; and (3) misalignment of nuclear sequences of mitochondrial origin (NUMTs). We observed that mtDNA copy number per cell varied across gnomAD cohorts and influenced the fraction of NUMT-derived false-positive variant calls, which can account for the majority of putative heteroplasmies. To avoid false positives, we excluded contaminated samples, cell lines, and samples prone to NUMT misalignment due to few mtDNA copies. Furthermore, we report variants with heteroplasmy ≥10%. We applied this pipeline to 56,434 whole-genome sequences in the gnomAD v3.1 database that includes individuals of European (58%), African (25%), Latino (10%), and Asian (5%) ancestry. Our gnomAD v3.1 release contains population frequencies for 10,850 unique mtDNA variants at more than half of all mtDNA bases. Importantly, we report frequencies within each nuclear ancestral population and mitochondrial haplogroup. Homoplasmic variants account for most variant calls (98%) and unique variants (85%). We observed that 1/250 individuals carry a pathogenic mtDNA variant with heteroplasmy above 10%. These mtDNA population allele frequencies are freely accessible and will aid in diagnostic interpretation and research studies.

Citing Articles

Enhancing diagnostic outcomes in kidney genetic disorders: the KidGen national kidney genomics study protocol.

Mallawaarachchi A, McCarthy H, Forbes T, Jayasinghe K, Patel C, Alexander S BMC Nephrol. 2025; 26(1):51.

PMID: 39901087 PMC: 11792728. DOI: 10.1186/s12882-024-03926-y.


Sequencing and characterizing human mitochondrial genomes in the biobank-based genomic research paradigm.

Luo L, Wang M, Liu Y, Li J, Bu F, Yuan H Sci China Life Sci. 2025; .

PMID: 39843848 DOI: 10.1007/s11427-024-2736-7.


Mitochondrial DNA variant detection in over 6,500 rare disease families by the systematic analysis of exome and genome sequencing data resolves undiagnosed cases.

Stenton S, Laricchia K, Lake N, Chaluvadi S, Ganesh V, DiTroia S medRxiv. 2025; .

PMID: 39763565 PMC: 11703311. DOI: 10.1101/2024.12.22.24319370.


Constraint reveals the mitochondrial genome sites most important for health and disease.

Nature. 2024; .

PMID: 39663435 DOI: 10.1038/d41586-024-04039-0.


Hospital-wide access to genomic data advanced pediatric rare disease research and clinical outcomes.

French C, Andrews N, Beggs A, Boone P, Brownstein C, Chopra M NPJ Genom Med. 2024; 9(1):60.

PMID: 39622807 PMC: 11612168. DOI: 10.1038/s41525-024-00441-9.


References
1.
Grady J, Pickett S, Ng Y, Alston C, Blakely E, Hardy S . mtDNA heteroplasmy level and copy number indicate disease burden in m.3243A>G mitochondrial disease. EMBO Mol Med. 2018; 10(6). PMC: 5991564. DOI: 10.15252/emmm.201708262. View

2.
McCormick E, Lott M, Dulik M, Shen L, Attimonelli M, Vitale O . Specifications of the ACMG/AMP standards and guidelines for mitochondrial DNA variant interpretation. Hum Mutat. 2020; 41(12):2028-2057. PMC: 7717623. DOI: 10.1002/humu.24107. View

3.
Wei W, Pagnamenta A, Gleadall N, Sanchis-Juan A, Stephens J, Broxholme J . Nuclear-mitochondrial DNA segments resemble paternally inherited mitochondrial DNA in humans. Nat Commun. 2020; 11(1):1740. PMC: 7142097. DOI: 10.1038/s41467-020-15336-3. View

4.
Cann R, Stoneking M, Wilson A . Mitochondrial DNA and human evolution. Nature. 1987; 325(6099):31-6. DOI: 10.1038/325031a0. View

5.
Lek M, Karczewski K, Minikel E, Samocha K, Banks E, Fennell T . Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016; 536(7616):285-91. PMC: 5018207. DOI: 10.1038/nature19057. View