Classification of Metagenomic Sequences: Methods and Challenges
Overview
Affiliations
Characterizing the taxonomic diversity of microbial communities is one of the primary objectives of metagenomic studies. Taxonomic analysis of microbial communities, a process referred to as binning, is challenging for the following reasons. Primarily, query sequences originating from the genomes of most microbes in an environmental sample lack taxonomically related sequences in existing reference databases. This absence of a taxonomic context makes binning a very challenging task. Limitations of current sequencing platforms, with respect to short read lengths and sequencing errors/artifacts, are also key factors that determine the overall binning efficiency. Furthermore, the sheer volume of metagenomic datasets also demands highly efficient algorithms that can operate within reasonable requirements of compute power. This review discusses the premise, methodologies, advantages, limitations and challenges of various methods available for binning of metagenomic datasets obtained using the shotgun sequencing approach. Various parameters as well as strategies used for evaluating binning efficiency are then reviewed.
Investigation of the Blood Microbiome in Horses With Fever of Unknown Origin.
Sun Y, Yu Y, Castillo X, Anderson R, Wang M, Sun Q Vet Med Sci. 2025; 11(2):e70272.
PMID: 40065594 PMC: 11893731. DOI: 10.1002/vms3.70272.
Unraveling the ancient fungal DNA from the Iceman gut.
Oskolkov N, Sandionigi A, Gotherstrom A, Canini F, Turchetti B, Zucconi L BMC Genomics. 2024; 25(1):1225.
PMID: 39701966 PMC: 11660557. DOI: 10.1186/s12864-024-11123-2.
Biofilm marker discovery with cloud-based dockerized metagenomics analysis of microbial communities.
Gnimpieba E, Hartman T, Do T, Zylla J, Aryal S, Haas S Brief Bioinform. 2024; 25(Supplement_1).
PMID: 39266450 PMC: 11392556. DOI: 10.1093/bib/bbae429.
CAIM: coverage-based analysis for identification of microbiome.
Acheampong D, Jenjaroenpun P, Wongsurawat T, Kurilung A, Pomyen Y, Kandel S Brief Bioinform. 2024; 25(5).
PMID: 39222062 PMC: 11367759. DOI: 10.1093/bib/bbae424.
Van Uffelen A, Posadas A, Roosens N, Marchal K, de Keersmaecker S, Vanneste K Sci Data. 2024; 11(1):864.
PMID: 39127718 PMC: 11316826. DOI: 10.1038/s41597-024-03672-8.