ContigExtender: a New Approach to Improving De Novo Sequence Assembly for Viral Metagenomics Data
Overview
Authors
Affiliations
Background: Metagenomics is the study of microbial genomes for pathogen detection and discovery in human clinical, animal, and environmental samples via Next-Generation Sequencing (NGS). Metagenome de novo sequence assembly is a crucial analytical step in which longer contigs, ideally whole chromosomes/genomes, are formed from shorter NGS reads. However, the contigs generated from the de novo assembly are often very fragmented and rarely longer than a few kilo base pairs (kb). Therefore, a time-consuming extension process is routinely performed on the de novo assembled contigs.
Results: To facilitate this process, we propose a new tool for metagenome contig extension after de novo assembly. ContigExtender employs a novel recursive extending strategy that explores multiple extending paths to achieve highly accurate longer contigs. We demonstrate that ContigExtender outperforms existing tools in synthetic, animal, and human metagenomics datasets.
Conclusions: A novel software tool ContigExtender has been developed to assist and enhance the performance of metagenome de novo assembly. ContigExtender effectively extends contigs from a variety of sources and can be incorporated in most viral metagenomics analysis pipelines for a wide variety of applications, including pathogen detection and viral discovery.
Virseqimprover: an integrated pipeline for viral contig error correction, extension, and annotation.
Song H, Tithi S, Brown C, Aylward F, Jensen R, Zhang L PeerJ. 2025; 13():e18515.
PMID: 39807156 PMC: 11727651. DOI: 10.7717/peerj.18515.
Kent A, Spicer L, Campbell D, Breaker E, McAllister G, Ewing T Antimicrob Agents Chemother. 2024; 68(11):e0059124.
PMID: 39404260 PMC: 11539240. DOI: 10.1128/aac.00591-24.
COBRA improves the completeness and contiguity of viral genomes assembled from metagenomes.
Chen L, Banfield J Nat Microbiol. 2024; 9(3):737-750.
PMID: 38321183 PMC: 10914622. DOI: 10.1038/s41564-023-01598-2.
Exploring the Archaeal Virosphere by Metagenomics.
Zhou Y, Wang Y, Prangishvili D, Krupovic M Methods Mol Biol. 2023; 2732:1-22.
PMID: 38060114 DOI: 10.1007/978-1-0716-3515-5_1.
van der Loos L, De Coninck L, Zell R, Lequime S, Willems A, De Clerck O Microbiol Spectr. 2023; :e0025523.
PMID: 37724866 PMC: 10581178. DOI: 10.1128/spectrum.00255-23.