» Articles » PMID: 30423086

Fastp: an Ultra-fast All-in-one FASTQ Preprocessor

Overview
Journal Bioinformatics
Specialty Biology
Date 2018 Nov 14
PMID 30423086
Citations 8465
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Quality control and preprocessing of FASTQ files are essential to providing clean data for downstream analysis. Traditionally, a different tool is used for each operation, such as quality control, adapter trimming and quality filtering. These tools are often insufficiently fast as most are developed using high-level programming languages (e.g. Python and Java) and provide limited multi-threading support. Reading and loading data multiple times also renders preprocessing slow and I/O inefficient.

Results: We developed fastp as an ultra-fast FASTQ preprocessor with useful quality control and data-filtering features. It can perform quality control, adapter trimming, quality filtering, per-read quality pruning and many other operations with a single scan of the FASTQ data. This tool is developed in C++ and has multi-threading support. Based on our evaluation, fastp is 2-5 times faster than other FASTQ preprocessing tools such as Trimmomatic or Cutadapt despite performing far more operations than similar tools.

Availability And Implementation: The open-source code and corresponding instructions are available at https://github.com/OpenGene/fastp.

Citing Articles

Chromosome-level genome assembly of a critically endangered species Leuciscus chuanchicus.

Wang Q, Zhou Q, Liu H, Li J, Jiang Y Sci Data. 2025; 12(1):441.

PMID: 40089515 DOI: 10.1038/s41597-025-04787-2.


Axonal RNA localization is essential for long-term memory.

de Queiroz B, Laghrissi H, Rajeev S, Blot L, De Graeve F, Dehecq M Nat Commun. 2025; 16(1):2560.

PMID: 40089499 DOI: 10.1038/s41467-025-57651-7.


Single-cell and chromatin accessibility profiling reveals regulatory programs of pathogenic Th2 cells in allergic asthma.

Khan M, Alteneder M, Reiter W, Krausgruber T, Dobnikar L, Madern M Nat Commun. 2025; 16(1):2565.

PMID: 40089475 DOI: 10.1038/s41467-025-57590-3.


Diversity and functional features of the root-associated bacteriome are dependent on grapevine susceptibility to Plasmopara viticola.

Duret M, Wallner A, Besaury L, Aziz A Environ Microbiome. 2025; 20(1):30.

PMID: 40087775 DOI: 10.1186/s40793-025-00690-w.


NLRP4 unlocks an NK/macrophages-centered ecosystem to suppress non-small cell lung cancer.

Meng Z, Li J, Wang H, Cao Z, Lu W, Niu X Biomark Res. 2025; 13(1):44.

PMID: 40087771 DOI: 10.1186/s40364-025-00756-4.


References
1.
Smith T, Heger A, Sudbery I . UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy. Genome Res. 2017; 27(3):491-499. PMC: 5340976. DOI: 10.1101/gr.209601.116. View

2.
Langmead B, Salzberg S . Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012; 9(4):357-9. PMC: 3322381. DOI: 10.1038/nmeth.1923. View

3.
Chiang C, Layer R, Faust G, Lindberg M, Rose D, Garrison E . SpeedSeq: ultra-fast personal genome analysis and interpretation. Nat Methods. 2015; 12(10):966-8. PMC: 4589466. DOI: 10.1038/nmeth.3505. View

4.
Newman A, Lovejoy A, Klass D, Kurtz D, Chabon J, Scherer F . Integrated digital error suppression for improved detection of circulating tumor DNA. Nat Biotechnol. 2016; 34(5):547-555. PMC: 4907374. DOI: 10.1038/nbt.3520. View

5.
Bolger A, Lohse M, Usadel B . Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014; 30(15):2114-20. PMC: 4103590. DOI: 10.1093/bioinformatics/btu170. View