» Articles » PMID: 23284954

FastUniq: a Fast De Novo Duplicates Removal Tool for Paired Short Reads

Overview
Journal PLoS One
Date 2013 Jan 4
PMID 23284954
Citations 282
Authors
Affiliations
Soon will be listed here.
Abstract

The presence of duplicates introduced by PCR amplification is a major issue in paired short reads from next-generation sequencing platforms. These duplicates might have a serious impact on research applications, such as scaffolding in whole-genome sequencing and discovering large-scale genome variations, and are usually removed. We present FastUniq as a fast de novo tool for removal of duplicates in paired short reads. FastUniq identifies duplicates by comparing sequences between read pairs and does not require complete genome sequences as prerequisites. FastUniq is capable of simultaneously handling reads with different lengths and results in highly efficient running time, which increases linearly at an average speed of 87 million reads per 10 minutes. FastUniq is freely available at http://sourceforge.net/projects/fastuniq/.

Citing Articles

The six whole mitochondrial genomes for the species: features, evolution and phylogeny.

Xie S, Ma X, Wu H, Zang R, Li H, Liu M IMA Fungus. 2025; 16:e140572.

PMID: 40059983 PMC: 11889515. DOI: 10.3897/imafungus.16.140572.


methylGrapher: genome-graph-based processing of DNA methylation data from whole genome bisulfite sequencing.

Zhang W, Macias-Velasco J, Zhuo X, Belter Jr E, Tomlinson C, Garza J Nucleic Acids Res. 2025; 53(3).

PMID: 39868538 PMC: 11770346. DOI: 10.1093/nar/gkaf028.


Eukaryotic composition across seasons and social groups in the gut microbiota of wild baboons.

Chege M, Ferretti P, Webb S, Macharia R, Obiero G, Kamau J bioRxiv. 2025; .

PMID: 39763902 PMC: 11702614. DOI: 10.1101/2024.12.17.628920.


Long non-coding RNAs direct the SWI/SNF complex to cell type-specific enhancers.

Oo J, Warwick T, Palfi K, Lam F, McNicoll F, Prieto-Garcia C Nat Commun. 2025; 16(1):131.

PMID: 39747144 PMC: 11695977. DOI: 10.1038/s41467-024-55539-6.


The global RNA-binding protein RbpB is a regulator of polysaccharide utilization in Bacteroides thetaiotaomicron.

Ruttiger A, Ryan D, Spiga L, Lamm-Schmidt V, Prezza G, Reichardt S Nat Commun. 2025; 16(1):208.

PMID: 39747016 PMC: 11697453. DOI: 10.1038/s41467-024-55383-8.


References
1.
Clark M, Chen R, Lam H, Karczewski K, Chen R, Euskirchen G . Performance comparison of exome DNA sequencing technologies. Nat Biotechnol. 2011; 29(10):908-14. PMC: 4127531. DOI: 10.1038/nbt.1975. View

2.
Pireddu L, Leo S, Zanetti G . SEAL: a distributed short read mapping and duplicate removal tool. Bioinformatics. 2011; 27(15):2159-60. PMC: 3137215. DOI: 10.1093/bioinformatics/btr325. View

3.
Zerbino D, Birney E . Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008; 18(5):821-9. PMC: 2336801. DOI: 10.1101/gr.074492.107. View

4.
Li R, Zhu H, Ruan J, Qian W, Fang X, Shi Z . De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 2009; 20(2):265-72. PMC: 2813482. DOI: 10.1101/gr.097261.109. View

5.
Gnerre S, MacCallum I, Przybylski D, Ribeiro F, Burton J, Walker B . High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci U S A. 2010; 108(4):1513-8. PMC: 3029755. DOI: 10.1073/pnas.1017351108. View