» Articles » PMID: 15175415

Phylogenomics of Eukaryotes: Impact of Missing Data on Large Alignments

Overview
Journal Mol Biol Evol
Specialty Biology
Date 2004 Jun 4
PMID 15175415
Citations 119
Authors
Affiliations
Soon will be listed here.
Abstract

Resolving the relationships between Metazoa and other eukaryotic groups as well as between metazoan phyla is central to the understanding of the origin and evolution of animals. The current view is based on limited data sets, either a single gene with many species (e.g., ribosomal RNA) or many genes but with only a few species. Because a reliable phylogenetic inference simultaneously requires numerous genes and numerous species, we assembled a very large data set containing 129 orthologous proteins ( approximately 30,000 aligned amino acid positions) for 36 eukaryotic species. Included in the alignments are data from the choanoflagellate Monosiga ovata, obtained through the sequencing of about 1,000 cDNAs. We provide conclusive support for choanoflagellates as the closest relative of animals and for fungi as the second closest. The monophyly of Plantae and chromalveolates was recovered but without strong statistical support. Within animals, in contrast to the monophyly of Coelomata observed in several recent large-scale analyses, we recovered a paraphyletic Coelamata, with nematodes and platyhelminths nested within. To include a diverse sample of organisms, data from EST projects were used for several species, resulting in a large amount of missing data in our alignment (about 25%). By using different approaches, we verify that the inferred phylogeny is not sensitive to these missing data. Therefore, this large data set provides a reliable phylogenetic framework for studying eukaryotic and animal evolution and will be easily extendable when large amounts of sequence information become available from a broader taxonomic range.

Citing Articles

BAD2matrix: Phylogenomic matrix concatenation, indel coding, and more.

Salinas N, Eshel G, Coruzzi G, DeSalle R, Tessler M, Little D Appl Plant Sci. 2024; 12(6):e11604.

PMID: 39628543 PMC: 11610412. DOI: 10.1002/aps3.11604.


Data-driven guidelines for phylogenomic analyses using SNP data.

Suissa J, De La Cerda G, Graber L, Jelley C, Wickell D, Phillips H Appl Plant Sci. 2024; 12(6):e11611.

PMID: 39628540 PMC: 11610416. DOI: 10.1002/aps3.11611.


Evolutionary Insights into the Relationship of Frogs, Salamanders, and Caecilians and Their Adaptive Traits, with an Emphasis on Salamander Regeneration and Longevity.

Lu B Animals (Basel). 2023; 13(22).

PMID: 38003067 PMC: 10668855. DOI: 10.3390/ani13223449.


Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs.

Portik D, Streicher J, Blackburn D, Moen D, Hutter C, Wiens J Mol Biol Evol. 2023; 40(5).

PMID: 37140129 PMC: 10202597. DOI: 10.1093/molbev/msad109.


First putative occurrence in the fossil record of choanoflagellates, the sister group of Metazoa.

Fonseca C, Mendonca Filho J, Reolid M, Duarte L, de Oliveira A, Souza J Sci Rep. 2023; 13(1):1242.

PMID: 36690681 PMC: 9870899. DOI: 10.1038/s41598-022-26972-8.