Read-SpaM: Assembly-free and Alignment-free Comparison of Bacterial Genomes with Low Sequencing Coverage
Overview
Authors
Affiliations
Background: In many fields of biomedical research, it is important to estimate phylogenetic distances between taxa based on low-coverage sequencing reads. Major applications are, for example, phylogeny reconstruction, species identification from small sequencing samples, or bacterial strain typing in medical diagnostics.
Results: We adapted our previously developed software program Filtered Spaced-Word Matches (FSWM) for alignment-free phylogeny reconstruction to take unassembled reads as input; we call this implementation Read-SpaM.
Conclusions: Test runs on simulated reads from semi-artificial and real-world bacterial genomes show that our approach can estimate phylogenetic distances with high accuracy, even for large evolutionary distances and for very low sequencing coverage.
An alignment-free method for phylogeny estimation using maximum likelihood.
Zahin T, Abrar M, Jewel M, Tasnim T, Bayzid M, Rahman A BMC Bioinformatics. 2025; 26(1):77.
PMID: 40055594 PMC: 11887328. DOI: 10.1186/s12859-025-06080-w.
Chow C, Ghosh S, Hadarovich A, Toth-Petroczy A Proc Natl Acad Sci U S A. 2024; 121(42):e2401622121.
PMID: 39383002 PMC: 11494347. DOI: 10.1073/pnas.2401622121.
CONSULT-II: accurate taxonomic identification and profiling using locality-sensitive hashing.
Sapci A, Rachtman E, Mirarab S Bioinformatics. 2024; 40(4).
PMID: 38492564 PMC: 10985673. DOI: 10.1093/bioinformatics/btae150.
App-SpaM: phylogenetic placement of short reads without sequence alignment.
Blanke M, Morgenstern B Bioinform Adv. 2023; 1(1):vbab027.
PMID: 36700102 PMC: 9710606. DOI: 10.1093/bioadv/vbab027.
Rachtman E, Sarmashghi S, Bafna V, Mirarab S Cell Syst. 2022; 13(10):817-829.e3.
PMID: 36265468 PMC: 9589918. DOI: 10.1016/j.cels.2022.06.007.