» Articles » PMID: 33706720

ContigExtender: a New Approach to Improving De Novo Sequence Assembly for Viral Metagenomics Data

Overview
Publisher Biomed Central
Specialty Biology
Date 2021 Mar 12
PMID 33706720
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Metagenomics is the study of microbial genomes for pathogen detection and discovery in human clinical, animal, and environmental samples via Next-Generation Sequencing (NGS). Metagenome de novo sequence assembly is a crucial analytical step in which longer contigs, ideally whole chromosomes/genomes, are formed from shorter NGS reads. However, the contigs generated from the de novo assembly are often very fragmented and rarely longer than a few kilo base pairs (kb). Therefore, a time-consuming extension process is routinely performed on the de novo assembled contigs.

Results: To facilitate this process, we propose a new tool for metagenome contig extension after de novo assembly. ContigExtender employs a novel recursive extending strategy that explores multiple extending paths to achieve highly accurate longer contigs. We demonstrate that ContigExtender outperforms existing tools in synthetic, animal, and human metagenomics datasets.

Conclusions: A novel software tool ContigExtender has been developed to assist and enhance the performance of metagenome de novo assembly. ContigExtender effectively extends contigs from a variety of sources and can be incorporated in most viral metagenomics analysis pipelines for a wide variety of applications, including pathogen detection and viral discovery.

Citing Articles

Virseqimprover: an integrated pipeline for viral contig error correction, extension, and annotation.

Song H, Tithi S, Brown C, Aylward F, Jensen R, Zhang L PeerJ. 2025; 13():e18515.

PMID: 39807156 PMC: 11727651. DOI: 10.7717/peerj.18515.


Sentinel Surveillance reveals phylogenetic diversity and detection of linear plasmids harboring and among enterococci collected in the United States.

Kent A, Spicer L, Campbell D, Breaker E, McAllister G, Ewing T Antimicrob Agents Chemother. 2024; 68(11):e0059124.

PMID: 39404260 PMC: 11539240. DOI: 10.1128/aac.00591-24.


COBRA improves the completeness and contiguity of viral genomes assembled from metagenomes.

Chen L, Banfield J Nat Microbiol. 2024; 9(3):737-750.

PMID: 38321183 PMC: 10914622. DOI: 10.1038/s41564-023-01598-2.


Exploring the Archaeal Virosphere by Metagenomics.

Zhou Y, Wang Y, Prangishvili D, Krupovic M Methods Mol Biol. 2023; 2732:1-22.

PMID: 38060114 DOI: 10.1007/978-1-0716-3515-5_1.


Highly divergent CRESS DNA and picorna-like viruses associated with bleached thalli of the green seaweed .

van der Loos L, De Coninck L, Zell R, Lequime S, Willems A, De Clerck O Microbiol Spectr. 2023; :e0025523.

PMID: 37724866 PMC: 10581178. DOI: 10.1128/spectrum.00255-23.


References
1.
Clarke E, Taylor L, Zhao C, Connell A, Lee J, Fett B . Sunbeam: an extensible pipeline for analyzing metagenomic sequencing experiments. Microbiome. 2019; 7(1):46. PMC: 6429786. DOI: 10.1186/s40168-019-0658-x. View

2.
Phan T, da Costa A, Zhang W, Pothier P, Ambert-Balay K, Deng X . A new gyrovirus in human feces. Virus Genes. 2015; 51(1):132-5. PMC: 4519424. DOI: 10.1007/s11262-015-1210-0. View

3.
Kapusinszky B, Ardeshir A, Mulvaney U, Deng X, Delwart E . Case-Control Comparison of Enteric Viromes in Captive Rhesus Macaques with Acute or Idiopathic Chronic Diarrhea. J Virol. 2017; 91(18). PMC: 5571273. DOI: 10.1128/JVI.00952-17. View

4.
Afiahayati , Sato K, Sakakibara Y . An extended genovo metagenomic assembler by incorporating paired-end information. PeerJ. 2013; 1:e196. PMC: 3817583. DOI: 10.7717/peerj.196. View

5.
Yang X, Charlebois P, Gnerre S, Coole M, Lennon N, Levin J . De novo assembly of highly diverse viral populations. BMC Genomics. 2012; 13:475. PMC: 3469330. DOI: 10.1186/1471-2164-13-475. View