» Articles » PMID: 37508462

Transformer Architecture and Attention Mechanisms in Genome Data Analysis: A Comprehensive Review

Overview
Journal Biology (Basel)
Publisher MDPI
Specialty Biology
Date 2023 Jul 29
PMID 37508462
Authors
Affiliations
Soon will be listed here.
Abstract

The emergence and rapid development of deep learning, specifically transformer-based architectures and attention mechanisms, have had transformative implications across several domains, including bioinformatics and genome data analysis. The analogous nature of genome sequences to language texts has enabled the application of techniques that have exhibited success in fields ranging from natural language processing to genomic data. This review provides a comprehensive analysis of the most recent advancements in the application of transformer architectures and attention mechanisms to genome and transcriptome data. The focus of this review is on the critical evaluation of these techniques, discussing their advantages and limitations in the context of genome data analysis. With the swift pace of development in deep learning methodologies, it becomes vital to continually assess and reflect on the current standing and future direction of the research. Therefore, this review aims to serve as a timely resource for both seasoned researchers and newcomers, offering a panoramic view of the recent advancements and elucidating the state-of-the-art applications in the field. Furthermore, this review paper serves to highlight potential areas of future investigation by critically evaluating studies from 2019 to 2023, thereby acting as a stepping-stone for further research endeavors.

Citing Articles

Applications of Artificial Intelligence, Deep Learning, and Machine Learning to Support the Analysis of Microscopic Images of Cells and Tissues.

Ali M, Benfante V, Basirinia G, Alongi P, Sperandeo A, Quattrocchi A J Imaging. 2025; 11(2).

PMID: 39997561 PMC: 11856378. DOI: 10.3390/jimaging11020059.


Sudden Fall Detection of Human Body Using Transformer Model.

Kibet D, So M, Kang H, Han Y, Shin J Sensors (Basel). 2025; 24(24.

PMID: 39771788 PMC: 11679820. DOI: 10.3390/s24248051.


Deep learning-based metabolomics data study of prostate cancer.

Sun L, Fan X, Zhao Y, Zhang Q, Jiang M BMC Bioinformatics. 2024; 25(1):391.

PMID: 39725937 PMC: 11674358. DOI: 10.1186/s12859-024-06016-w.


Opportunities, challenges and future perspectives of using bioinformatics and artificial intelligence techniques on tropical disease identification using omics data.

Vidanagamachchi S, Waidyarathna K Front Digit Health. 2024; 6:1471200.

PMID: 39654982 PMC: 11625773. DOI: 10.3389/fdgth.2024.1471200.


CalTrig: A GUI-based Machine Learning Approach for Decoding Neuronal Calcium Transients in Freely Moving Rodents.

Lange M, Chen Y, Fu H, Korada A, Guo C, Ma Y bioRxiv. 2024; .

PMID: 39372793 PMC: 11451592. DOI: 10.1101/2024.09.30.615860.


References
1.
Raad J, Bugnon L, Milone D, Stegmayer G . miRe2e: a full end-to-end deep model based on transformers for prediction of pre-miRNAs. Bioinformatics. 2021; 38(5):1191-1197. DOI: 10.1093/bioinformatics/btab823. View

2.
Manica M, Oskooei A, Born J, Subramanian V, Saez-Rodriguez J, Rodriguez Martinez M . Toward Explainable Anticancer Compound Sensitivity Prediction via Multimodal Attention-Based Convolutional Encoders. Mol Pharm. 2019; 16(12):4797-4806. DOI: 10.1021/acs.molpharmaceut.9b00520. View

3.
Xiao L, Wan Y, Jiang Z . AttCRISPR: a spacetime interpretable model for prediction of sgRNA on-target activity. BMC Bioinformatics. 2021; 22(1):589. PMC: 8667445. DOI: 10.1186/s12859-021-04509-6. View

4.
Du Z, Xiao X, Uversky V . DeepA-RBPBS: A hybrid convolution and recurrent neural network combined with attention mechanism for predicting RBP binding site. J Biomol Struct Dyn. 2020; 40(9):4250-4258. DOI: 10.1080/07391102.2020.1854861. View

5.
Dominic N, Cenggoro T, Budiarto A, Pardamean B . Deep polygenic neural network for predicting and identifying yield-associated genes in Indonesian rice accessions. Sci Rep. 2022; 12(1):13823. PMC: 9378700. DOI: 10.1038/s41598-022-16075-9. View