» Articles » PMID: 20003500

BLAST+: Architecture and Applications

Overview
Publisher Biomed Central
Specialty Biology
Date 2009 Dec 17
PMID 20003500
Citations 8984
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Sequence similarity searching is a very important bioinformatics task. While Basic Local Alignment Search Tool (BLAST) outperforms exact methods through its use of heuristics, the speed of the current BLAST software is suboptimal for very long queries or database sequences. There are also some shortcomings in the user-interface of the current command-line applications.

Results: We describe features and improvements of rewritten BLAST software and introduce new command-line applications. Long query sequences are broken into chunks for processing, in some cases leading to dramatically shorter run times. For long database sequences, it is possible to retrieve only the relevant parts of the sequence, reducing CPU time and memory usage for searches of short queries against databases of contigs or chromosomes. The program can now retrieve masking information for database sequences from the BLAST databases. A new modular software library can now access subject sequence data from arbitrary data sources. We introduce several new features, including strategy files that allow a user to save and reuse their favorite set of options. The strategy files can be uploaded to and downloaded from the NCBI BLAST web site.

Conclusion: The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences. We have also improved the user interface of the command-line applications.

Citing Articles

A near-complete genome assembly of Fragaria iinumae.

Du H, He Y, Chen M, Zheng X, Gui D, Tang J BMC Genomics. 2025; 26(1):253.

PMID: 40087556 DOI: 10.1186/s12864-025-11440-0.


Chromosome-level haplotype-resolved genome assembly of bread wheat's wild relative Aegilops mutica.

Grewal S, Yang C, Krasheninnikova K, Collins J, Wood J, Ashling S Sci Data. 2025; 12(1):438.

PMID: 40082453 PMC: 11906796. DOI: 10.1038/s41597-025-04737-y.


A secure visualization platform for pathogenic genome analysis with an accurate reference database.

Fan G, Guo C, Zhang Q, Liu D, Sun Q, Cui Z Biosaf Health. 2025; 6(4):235-243.

PMID: 40078665 PMC: 11894998. DOI: 10.1016/j.bsheal.2024.07.003.


Bacterial Communities and Resistance and Virulence Genes in Hospital and Community Wastewater: Metagenomic Analysis.

Velazquez-Meza M, Galarde-Lopez M, Cornejo-Juarez P, Bobadilla-Del-Valle M, Godoy-Lozano E, Aguilar-Vera E Int J Mol Sci. 2025; 26(5).

PMID: 40076673 PMC: 11900532. DOI: 10.3390/ijms26052051.


A telomere-to-telomere phased genome of an octoploid strawberry reveals a receptor kinase conferring anthracnose resistance.

Han H, Salinas N, Barbey C, Jang Y, Fan Z, Verma S Gigascience. 2025; 14.

PMID: 40072904 PMC: 11899574. DOI: 10.1093/gigascience/giaf005.


References
1.
Morgulis A, Coulouris G, Raytselis Y, Madden T, Agarwala R, Schaffer A . Database indexing for production MegaBLAST searches. Bioinformatics. 2008; 24(16):1757-64. PMC: 2696921. DOI: 10.1093/bioinformatics/btn322. View

2.
Zhang Z, Schaffer A, Miller W, Madden T, Lipman D, Koonin E . Protein sequence similarity searches using patterns as seeds. Nucleic Acids Res. 1998; 26(17):3986-90. PMC: 147803. DOI: 10.1093/nar/26.17.3986. View

3.
Schaffer A, Wolf Y, Ponting C, Koonin E, Aravind L, Altschul S . IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices. Bioinformatics. 2000; 15(12):1000-11. DOI: 10.1093/bioinformatics/15.12.1000. View

4.
Schaffer A, Aravind L, Madden T, Shavirin S, Spouge J, Wolf Y . Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucleic Acids Res. 2001; 29(14):2994-3005. PMC: 55814. DOI: 10.1093/nar/29.14.2994. View

5.
Kent W . BLAT--the BLAST-like alignment tool. Genome Res. 2002; 12(4):656-64. PMC: 187518. DOI: 10.1101/gr.229202. View