» Articles » PMID: 32243090

MitoFinder: Efficient Automated Large-scale Extraction of Mitogenomic Data in Target Enrichment Phylogenomics

Overview
Journal Mol Ecol Resour
Date 2020 Apr 4
PMID 32243090
Citations 719
Authors
Affiliations
Soon will be listed here.
Abstract

Thanks to the development of high-throughput sequencing technologies, target enrichment sequencing of nuclear ultraconserved DNA elements (UCEs) now allows routine inference of phylogenetic relationships from thousands of genomic markers. Recently, it has been shown that mitochondrial DNA (mtDNA) is frequently sequenced alongside the targeted loci in such capture experiments. Despite its broad evolutionary interest, mtDNA is rarely assembled and used in conjunction with nuclear markers in capture-based studies. Here, we developed MitoFinder, a user-friendly bioinformatic pipeline, to efficiently assemble and annotate mitogenomic data from hundreds of UCE libraries. As a case study, we used ants (Formicidae) for which 501 UCE libraries have been sequenced whereas only 29 mitogenomes are available. We compared the efficiency of four different assemblers (IDBA-UD, MEGAHIT, MetaSPAdes, and Trinity) for assembling both UCE and mtDNA loci. Using MitoFinder, we show that metagenomic assemblers, in particular MetaSPAdes, are well suited to assemble both UCEs and mtDNA. Mitogenomic signal was successfully extracted from all 501 UCE libraries, allowing us to confirm species identification using CO1 barcoding. Moreover, our automated procedure retrieved 296 cases in which the mitochondrial genome was assembled in a single contig, thus increasing the number of available ant mitogenomes by an order of magnitude. By utilizing the power of metagenomic assemblers, MitoFinder provides an efficient tool to extract complementary mitogenomic data from UCE libraries, allowing testing for potential mitonuclear discordance. Our approach is potentially applicable to other sequence capture methods, transcriptomic data and whole genome shotgun sequencing in diverse taxa. The MitoFinder software is available from GitHub (https://github.com/RemiAllio/MitoFinder).

Citing Articles

The genome sequence of a flea beetle, (Marsham, 1802).

Geiser M, Sims I Wellcome Open Res. 2025; 10:62.

PMID: 40078959 PMC: 11897695. DOI: 10.12688/wellcomeopenres.23697.1.


The genome sequence of the Coppice Mining Bee, (Linnaeus, 1758).

Falk S, Monks J Wellcome Open Res. 2025; 10:102.

PMID: 40078958 PMC: 11897692. DOI: 10.12688/wellcomeopenres.23746.1.


The genome sequence of the Dotted Footman moth, (Hufnagel, 1767).

Fletcher C, Lees D Wellcome Open Res. 2025; 10:106.

PMID: 40078957 PMC: 11897693. DOI: 10.12688/wellcomeopenres.23766.1.


Evolutionary genomics reveals variation in structure and genetic content implicated in virulence and lifestyle in the genus Gaeumannomyces.

Hill R, Grey M, Fedi M, Smith D, Canning G, Ward S BMC Genomics. 2025; 26(1):239.

PMID: 40075289 PMC: 11905480. DOI: 10.1186/s12864-025-11432-0.


The genome sequence of the Antarctic lanternfish, (Günther, 1878).

Bista I, Collins M Wellcome Open Res. 2025; 10:89.

PMID: 40070982 PMC: 11894370. DOI: 10.12688/wellcomeopenres.23803.1.


References
1.
Castresana J . Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000; 17(4):540-52. DOI: 10.1093/oxfordjournals.molbev.a026334. View

2.
Nurk S, Meleshko D, Korobeynikov A, Pevzner P . metaSPAdes: a new versatile metagenomic assembler. Genome Res. 2017; 27(5):824-834. PMC: 5411777. DOI: 10.1101/gr.213959.116. View

3.
Ballenghien M, Faivre N, Galtier N . Patterns of cross-contamination in a multispecies population genomic project: detection, quantification, impact, and solutions. BMC Biol. 2017; 15(1):25. PMC: 5370491. DOI: 10.1186/s12915-017-0366-6. View

4.
Ward P, Branstetter M . The acacia ants revisited: convergent evolution and biogeographic context in an iconic ant/plant mutualism. Proc Biol Sci. 2017; 284(1850). PMC: 5360922. DOI: 10.1098/rspb.2016.2569. View

5.
Postma M, Goedhart J . PlotsOfData-A web app for visualizing data together with their summaries. PLoS Biol. 2019; 17(3):e3000202. PMC: 6453475. DOI: 10.1371/journal.pbio.3000202. View