» Articles » PMID: 35402983

Pairwise Heuristic Sequence Alignment Algorithm Based on Deep Reinforcement Learning

Overview
Publisher IEEE
Date 2022 Apr 11
PMID 35402983
Authors
Affiliations
Soon will be listed here.
Abstract

Various methods have been developed to analyze the association between organisms and their genomic sequences. Among them, sequence alignment is the most frequently used method for comparative analysis of biological genomes. We intend to propose a novel pairwise sequence alignment method using deep reinforcement learning to break out the old pairwise alignment algorithms. We defined the environment and agent to enable reinforcement learning in the sequence alignment system. This novel method, named DQNalign, can immediately determine the next direction by observing the subsequences within the moving window. DQNalign shows superiority in the dissimilar sequence pairs that have low identity values. And theoretically, we confirm that DQNalign has a low dimension for the sequence length in view of the complexity. This research shows the application method of deep reinforcement learning to the sequence alignment system and how deep reinforcement learning can improve the conventional sequence alignment method.

Citing Articles

Deep reinforcement learning-based pairwise DNA sequence alignment method compatible with embedded edge devices.

Lall A, Tallur S Sci Rep. 2023; 13(1):2773.

PMID: 36797269 PMC: 9935504. DOI: 10.1038/s41598-023-29277-6.


learnMSA: learning and aligning large protein families.

Becker F, Stanke M Gigascience. 2022; 11.

PMID: 36399060 PMC: 9673500. DOI: 10.1093/gigascience/giac104.


Heuristic Pairwise Alignment in Database Environments.

Liptak P, Kiss A, Szalai-Gindl J Genes (Basel). 2022; 13(11).

PMID: 36360242 PMC: 9690874. DOI: 10.3390/genes13112005.


Local Alignment of DNA Sequence Based on Deep Reinforcement Learning.

Song Y, Cho D IEEE Open J Eng Med Biol. 2022; 2:170-178.

PMID: 35402982 PMC: 8975175. DOI: 10.1109/OJEMB.2021.3076156.

References
1.
Chao K, Pearson W, Miller W . Aligning two sequences within a specified diagonal band. Comput Appl Biosci. 1992; 8(5):481-7. DOI: 10.1093/bioinformatics/8.5.481. View

2.
Tang J, Hua K, Chen M, Zhang R, Xie X . A novel k-word relative measure for sequence comparison. Comput Biol Chem. 2014; 53PB:331-338. DOI: 10.1016/j.compbiolchem.2014.10.007. View

3.
Hayashi T, Makino K, Ohnishi M, Kurokawa K, Ishii K, Yokoyama K . Complete genome sequence of enterohemorrhagic Escherichia coli O157:H7 and genomic comparison with a laboratory strain K-12. DNA Res. 2001; 8(1):11-22. DOI: 10.1093/dnares/8.1.11. View

4.
NEEDLEMAN S, Wunsch C . A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970; 48(3):443-53. DOI: 10.1016/0022-2836(70)90057-4. View

5.
Wolfsheimer S, Burghardt B, Hartmann A . Local sequence alignments statistics: deviations from Gumbel statistics in the rare-event tail. Algorithms Mol Biol. 2007; 2:9. PMC: 1945026. DOI: 10.1186/1748-7188-2-9. View