» Articles » PMID: 35772072

Proposal of Smith-Waterman Algorithm on FPGA to Accelerate the Forward and Backtracking Steps

Overview
Journal PLoS One
Date 2022 Jun 30
PMID 35772072
Authors
Affiliations
Soon will be listed here.
Abstract

In bioinformatics, alignment is an essential technique for finding similarities between biological sequences. Usually, the alignment is performed with the Smith-Waterman (SW) algorithm, a well-known sequence alignment technique of high-level precision based on dynamic programming. However, given the massive data volume in biological databases and their continuous exponential increase, high-speed data processing is necessary. Therefore, this work proposes a parallel hardware design for the SW algorithm with a systolic array structure to accelerate the forward and backtracking steps. For this purpose, the architecture calculates and stores the paths in the forward stage for pre-organizing the alignment, which reduces the complexity of the backtracking stage. The backtracking starts from the maximum score position in the matrix and generates the optimal SW sequence alignment path. The architecture was validated on Field-Programmable Gate Array (FPGA), and synthesis analyses have shown that the proposed design reaches up to 79.5 Giga Cell Updates per Second (GCPUS).

Citing Articles

CUDASW++4.0: ultra-fast GPU-based Smith-Waterman protein sequence database search.

Schmidt B, Kallenborn F, Chacon A, Hundt C BMC Bioinformatics. 2024; 25(1):342.

PMID: 39488701 PMC: 11531700. DOI: 10.1186/s12859-024-05965-6.

References
1.
Liu Y, Maskell D, Schmidt B . CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units. BMC Res Notes. 2009; 2:73. PMC: 2694204. DOI: 10.1186/1756-0500-2-73. View

2.
Masseroli M, Canakoglu A, Pinoli P, Kaitoua A, Gulino A, Horlova O . Processing of big heterogeneous genomic datasets for tertiary analysis of Next Generation Sequencing data. Bioinformatics. 2018; 35(5):729-736. DOI: 10.1093/bioinformatics/bty688. View

3.
Zhou P, Shi Z . SARS-CoV-2 spillover events. Science. 2021; 371(6525):120-122. DOI: 10.1126/science.abf6097. View

4.
Miller D, Martin M, Harel N, Tirosh O, Kustin T, Meir M . Full genome viral sequences inform patterns of SARS-CoV-2 spread into and within Israel. Nat Commun. 2020; 11(1):5518. PMC: 7606475. DOI: 10.1038/s41467-020-19248-0. View

5.
Courneya J, Mayo A . High-performance computing service for bioinformatics and data science. J Med Libr Assoc. 2018; 106(4):494-495. PMC: 6148605. DOI: 10.5195/jmla.2018.512. View