Mold, a Novel Software to Compile Accurate and Reliable DNA Diagnoses for Taxonomic Descriptions
Overview
Environmental Health
Molecular Biology
Affiliations
DNA data are increasingly being used for phylogenetic inference, and taxon delimitation and identification, but scarcely for the formal description of taxa, despite their undisputable merits in taxonomy. The uncertainty regarding the robustness of DNA diagnoses, however, remains a major impediment to their use. We have developed a new program, mold, that identifies diagnostic nucleotide combinations (DNCs) in DNA sequence alignments for selected taxa, which can be used to provide formal diagnoses of these taxa. To test the robustness of DNA diagnoses, we carry out iterated haplotype subsampling for selected query species in published DNA data sets of varying complexity. We quantify the reliability of diagnosis by diagnosing each query subsample and then checking if this diagnosis remains valid against the entire data set. We demonstrate that widely used types of diagnostic DNA characters are often absent for a query taxon or are not sufficiently reliable. We thus propose a new type of DNA diagnosis, termed "redundant DNC" (or rDNC), which takes into account unsampled genetic diversity, and constitutes a much more reliable descriptor of a taxon. mold successfully retrieves rDNCs for all but two species in the analysed data sets, even in those comprising hundreds of species. mold shows unparalleled efficiency in large DNA data sets and is the only available software capable of compiling DNA diagnoses that suit predefined criteria of reliability.
Vences M, Patmanidis S, Schmidt J, Matschiner M, Miralles A, Renner S Bioinform Adv. 2024; 4(1):vbae083.
PMID: 38895561 PMC: 11184345. DOI: 10.1093/bioadv/vbae083.
DNA Barcode-Based Species Diagnosis with MolD.
Fedosov A, Puillandre N, Fischell F, Patmanidis S, Miralles A, Vences M Methods Mol Biol. 2024; 2744:297-311.
PMID: 38683327 DOI: 10.1007/978-1-0716-3581-0_19.
iTaxoTools 1.0: Improved DNA Barcode Exploration with TaxI2.
Vences M, Patmanidis S, Fedosov A, Miralles A, Puillandre N Methods Mol Biol. 2024; 2744:281-296.
PMID: 38683326 DOI: 10.1007/978-1-0716-3581-0_18.
DNA Barcodes in Taxonomic Descriptions.
Brower A, DeSalle R Methods Mol Biol. 2024; 2744:105-115.
PMID: 38683313 DOI: 10.1007/978-1-0716-3581-0_5.
Rheindt F, Bouchard P, Pyle R, Welter-Schultes F, Aescht E, Ahyong S PLoS Biol. 2023; 21(8):e3002251.
PMID: 37607211 PMC: 10443861. DOI: 10.1371/journal.pbio.3002251.