» Articles » PMID: 35225328

BERT6mA: Prediction of DNA N6-methyladenine Site Using Deep Learning-based Approaches

Overview
Journal Brief Bioinform
Specialty Biology
Date 2022 Feb 28
PMID 35225328
Authors
Affiliations
Soon will be listed here.
Abstract

N6-methyladenine (6mA) is associated with important roles in DNA replication, DNA repair, transcription, regulation of gene expression. Several experimental methods were used to identify DNA modifications. However, these experimental methods are costly and time-consuming. To detect the 6mA and complement these shortcomings of experimental methods, we proposed a novel, deep leaning approach called BERT6mA. To compare the BERT6mA with other deep learning approaches, we used the benchmark datasets including 11 species. The BERT6mA presented the highest AUCs in eight species in independent tests. Furthermore, BERT6mA showed higher and comparable performance with the state-of-the-art models while the BERT6mA showed poor performances in a few species with a small sample size. To overcome this issue, pretraining and fine-tuning between two species were applied to the BERT6mA. The pretrained and fine-tuned models on specific species presented higher performances than other models even for the species with a small sample size. In addition to the prediction, we analyzed the attention weights generated by BERT6mA to reveal how the BERT6mA model extracts critical features responsible for the 6mA prediction. To facilitate biological sciences, the BERT6mA online web server and its source codes are freely accessible at https://github.com/kuratahiroyuki/BERT6mA.git, respectively.

Citing Articles

Deep5mC: Predicting 5-methylcytosine (5mC) methylation status using a deep learning transformer approach.

Kinnear E, Derbel H, Zhao Z, Liu Q Comput Struct Biotechnol J. 2025; 27:631-638.

PMID: 40041569 PMC: 11879672. DOI: 10.1016/j.csbj.2025.02.007.


RNA sequence analysis landscape: A comprehensive review of task types, databases, datasets, word embedding methods, and language models.

Asim M, Ibrahim M, Asif T, Dengel A Heliyon. 2025; 11(2):e41488.

PMID: 39897847 PMC: 11783440. DOI: 10.1016/j.heliyon.2024.e41488.


iResNetDM: An interpretable deep learning approach for four types of DNA methylation modification prediction.

Yang Z, Shao W, Matsuda Y, Song L Comput Struct Biotechnol J. 2024; 23:4214-4221.

PMID: 39650332 PMC: 11621598. DOI: 10.1016/j.csbj.2024.11.006.


PSATF-6mA: an integrated learning fusion feature-encoded DNA-6 mA methylcytosine modification site recognition model based on attentional mechanisms.

Kang Y, Wang H, Qin Y, Liu G, Yu Y, Zhang Y Front Genet. 2024; 15:1498884.

PMID: 39600317 PMC: 11588721. DOI: 10.3389/fgene.2024.1498884.


MFPSP: Identification of fungal species-specific phosphorylation site using offspring competition-based genetic algorithm.

Wang C, Zou Q PLoS Comput Biol. 2024; 20(11):e1012607.

PMID: 39556608 PMC: 11611262. DOI: 10.1371/journal.pcbi.1012607.


References
1.
Boulias K, Greer E . Detection of DNA Methylation in Genomic DNA by UHPLC-MS/MS. Methods Mol Biol. 2020; 2198:79-90. PMC: 8281577. DOI: 10.1007/978-1-0716-0876-0_7. View

2.
Huang Q, Zhou W, Guo F, Xu L, Zhang L . 6mA-Pred: identifying DNA N6-methyladenine sites based on deep learning. PeerJ. 2021; 9:e10813. PMC: 7866889. DOI: 10.7717/peerj.10813. View

3.
Li Z, Jiang H, Kong L, Chen Y, Lang K, Fan X . Deep6mA: A deep learning framework for exploring similar patterns in DNA N6-methyladenine sites across different species. PLoS Comput Biol. 2021; 17(2):e1008767. PMC: 7924747. DOI: 10.1371/journal.pcbi.1008767. View

4.
Charoenkwan P, Nantasenamat C, Hasan M, Manavalan B, Shoombuatong W . BERT4Bitter: a bidirectional encoder representations from transformers (BERT)-based model for improving the prediction of bitter peptides. Bioinformatics. 2021; 37(17):2556-2562. DOI: 10.1093/bioinformatics/btab133. View

5.
Flusberg B, Webster D, Lee J, Travers K, Olivares E, Clark T . Direct detection of DNA methylation during single-molecule, real-time sequencing. Nat Methods. 2010; 7(6):461-5. PMC: 2879396. DOI: 10.1038/nmeth.1459. View