» Articles » PMID: 10508846

CAP3: A DNA Sequence Assembly Program

Overview
Journal Genome Res
Specialty Genetics
Date 1999 Oct 6
PMID 10508846
Citations 2197
Authors
Affiliations
Soon will be listed here.
Abstract

We describe the third generation of the CAP sequence assembly program. The CAP3 program includes a number of improvements and new features. The program has a capability to clip 5' and 3' low-quality regions of reads. It uses base quality values in computation of overlaps between reads, construction of multiple sequence alignments of reads, and generation of consensus sequences. The program also uses forward-reverse constraints to correct assembly errors and link contigs. Results of CAP3 on four BAC data sets are presented. The performance of CAP3 was compared with that of PHRAP on a number of BAC data sets. PHRAP often produces longer contigs than CAP3 whereas CAP3 often produces fewer errors in consensus sequences than PHRAP. It is easier to construct scaffolds with CAP3 than with PHRAP on low-pass data with forward-reverse constraints.

Citing Articles

Comparative analysis of the mitochondrial genomes of the soft-shelled turtles Palea steindachneri and Pelodiscus axenaria and phylogenetic implications for Trionychia.

Chen C, Ji L, Huang G, Liu X, Chen H, Wang Y Sci Rep. 2025; 15(1):7138.

PMID: 40021811 PMC: 11871352. DOI: 10.1038/s41598-025-90985-2.


Treatment of avian malaria in captive African penguins () by the combination of atovaquone and proguanil hydrochloride.

Samarelli R, Pugliese N, Saleh M, Prioletti M, Cordon R, Cavicchio P Int J Vet Sci Med. 2025; 13(1):1-8.

PMID: 40007641 PMC: 11852231. DOI: 10.1080/23144599.2025.2460919.


Building resource-efficient community databases using open-source software.

Jung S, Cheng C, Lee T, Buble K, Humann J, Zheng P Database (Oxford). 2025; 2025.

PMID: 39937662 PMC: 11833237. DOI: 10.1093/database/baaf005.


Accurate assembly of full-length consensus for viral quasispecies.

Tian J, Gao Z, Li M, Bao E, Zhao J BMC Bioinformatics. 2025; 26(1):36.

PMID: 39893441 PMC: 11787740. DOI: 10.1186/s12859-025-06045-z.


Frameshift variation in the HMG-CoA reductase gene and unresponsiveness to cholesterol-lowering drugs in type 2 diabetes mellitus patients.

Khaleqsefat E, Rasul K, Kheder R, Baban S, Baban J Sci Rep. 2025; 15(1):288.

PMID: 39747109 PMC: 11695833. DOI: 10.1038/s41598-024-75461-7.


References
1.
Staden R . A new computer method for the storage and manipulation of DNA gel reading data. Nucleic Acids Res. 1980; 8(16):3673-94. PMC: 324183. DOI: 10.1093/nar/8.16.3673. View

2.
Huang X . Fast comparison of a DNA sequence with a protein sequence database. Microb Comp Genomics. 1996; 1(4):281-91. DOI: 10.1089/mcg.1996.1.281. View

3.
Peltola H, Soderlund H, Ukkonen E . SEQAID: a DNA sequence assembling program based on a mathematical model. Nucleic Acids Res. 1984; 12(1 Pt 1):307-21. PMC: 321006. DOI: 10.1093/nar/12.1part1.307. View

4.
Pearson W, Lipman D . Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A. 1988; 85(8):2444-8. PMC: 280013. DOI: 10.1073/pnas.85.8.2444. View

5.
Myers E, Miller W . Optimal alignments in linear space. Comput Appl Biosci. 1988; 4(1):11-7. DOI: 10.1093/bioinformatics/4.1.11. View