FLASH: Fast Length Adjustment of Short Reads to Improve Genome Assemblies
Overview
Authors
Affiliations
Motivation: Next-generation sequencing technologies generate very large numbers of short reads. Even with very deep genome coverage, short read lengths cause problems in de novo assemblies. The use of paired-end libraries with a fragment size shorter than twice the read length provides an opportunity to generate much longer reads by overlapping and merging read pairs before assembling a genome.
Results: We present FLASH, a fast computational tool to extend the length of short reads by overlapping paired-end reads from fragment libraries that are sufficiently short. We tested the correctness of the tool on one million simulated read pairs, and we then applied it as a pre-processor for genome assemblies of Illumina reads from the bacterium Staphylococcus aureus and human chromosome 14. FLASH correctly extended and merged reads >99% of the time on simulated reads with an error rate of <1%. With adequately set parameters, FLASH correctly merged reads over 90% of the time even when the reads contained up to 5% errors. When FLASH was used to extend reads prior to assembly, the resulting assemblies had substantially greater N50 lengths for both contigs and scaffolds.
Availability And Implementation: The FLASH system is implemented in C and is freely available as open-source code at http://www.cbcb.umd.edu/software/flash.
Contact: t.magoc@gmail.com.
Liu X, Zhang X, He Q, Sun X, Wang W, Li S BMC Microbiol. 2025; 25(1):138.
PMID: 40087566 DOI: 10.1186/s12866-024-03733-3.
Evaluation of gut microbiota alterations following orlistat administration in obese mice.
Xue C, Wang T, Chen Y, Zhang H, Wang H, Li Q Front Endocrinol (Lausanne). 2025; 15:1337245.
PMID: 40078888 PMC: 11896870. DOI: 10.3389/fendo.2024.1337245.
Connolly K, Sweeney T, Ryan M, Vigors S, ODoherty J Animals (Basel). 2025; 15(5).
PMID: 40075985 PMC: 11899430. DOI: 10.3390/ani15050702.
The Gut Microbiota of the Greater Horseshoe Bat Confers Rapidly Corresponding Immune Cells in Mice.
Luo S, Huang X, Chen S, Li J, Wu H, He Y Animals (Basel). 2025; 15(5).
PMID: 40075967 PMC: 11899282. DOI: 10.3390/ani15050685.
Govindharaj G, Annamalai M, Choudhary J, Khan R, Basana-Gowda G, Patil N Sci Rep. 2025; 15(1):8552.
PMID: 40074819 PMC: 11903862. DOI: 10.1038/s41598-025-93048-8.