Freeing Phylogenies from Artifacts of Alignment
Overview
Authors
Affiliations
Widely used methods for phylogenetic inference, both those that require and those that produce alignments, share certain weaknesses. These weaknesses are discussed, and a method that lacks them is introduced. For each pair of sequences in the data set, the method utilizes both insertion-deletion and amino acid replacement information to estimate a pairwise evolutionary distance. It is also possible to allow regional heterogeneity of replacement rates. Because a likelihood framework is adopted, the standard deviation of each pairwise distance can be estimated. The distance matrix and standard error estimates are used to infer a phylogenetic tree. As an example, this method is used on 10 widely diverged sequences of the second largest RNA polymerase subunit. A pseudo-bootstrap technique is devised to assess the validity of the inferred phylogenetic tree.
Rangel L, Fournier G Microorganisms. 2023; 11(10).
PMID: 37894157 PMC: 10609509. DOI: 10.3390/microorganisms11102499.
Skmer: assembly-free and alignment-free sample identification using genome skims.
Sarmashghi S, Bohmann K, Gilbert M, Bafna V, Mirarab S Genome Biol. 2019; 20(1):34.
PMID: 30760303 PMC: 6374904. DOI: 10.1186/s13059-019-1632-4.
Imputing missing distances in molecular phylogenetics.
Xia X PeerJ. 2018; 6:e5321.
PMID: 30065887 PMC: 6063210. DOI: 10.7717/peerj.5321.
String kernels for protein sequence comparisons: improved fold recognition.
Nojoomi S, Koehl P BMC Bioinformatics. 2017; 18(1):137.
PMID: 28245816 PMC: 5331664. DOI: 10.1186/s12859-017-1560-9.
Herman J, Novak A, Lyngso R, Szabo A, Miklos I, Hein J BMC Bioinformatics. 2015; 16:108.
PMID: 25888064 PMC: 4395974. DOI: 10.1186/s12859-015-0516-1.