Biomedical Named Entity Recognition Using Improved Green Anaconda-assisted Bi-GRU-based Hierarchical ResNet Model

Overview

Journal BMC Bioinformatics

Publisher Biomed Central

Date 2025 Jan 30

PMID 39885428

Authors

Ram Chandra Bhushan

Rakesh Kumar Donthi

Yojitha Chilukuri

Ulligaddala Srinivasarao

Polisetty Swetha

Affiliations

Soon will be listed here.

Abstract

Background: Biomedical text mining is a technique that extracts essential information from scientific articles using named entity recognition (NER). Traditional NER methods rely on dictionaries, rules, or curated corpora, which may not always be accessible. To overcome these challenges, deep learning (DL) methods have emerged. However, DL-based NER methods may need help identifying long-distance relationships within text and require significant annotated datasets.

Results: This research has proposed a novel model to address the challenges in natural language processing. The Improved Green anaconda-assisted Bi-GRU based Hierarchical ResNet BNER model (IGa-BiHR BNERM) is the model. IGa-BiHR BNERM model has shown promising results in accurately identifying named entities. The MACCROBAT dataset was obtained from Kaggle and underwent several pre-processing steps such as Stop Word Filtering, WordNet processing, Removal of non-alphanumeric characters, stemming Segmentation, and Tokenization, which is standardized and improves its quality. The pre-processed text was fed into a feature extraction model like the Robustly Optimized BERT -Whole Word Masking model. This model provides word embeddings with semantic information. Then, the BNER process utilized an Improved Green Anaconda-assisted Bi-GRU-based Hierarchical ResNet BNER model (IGa-BiHR BNERM).

Conclusion: To improve the training phase of the IGa-BiHR BNERM, the Improved Green Anaconda Optimization technique was used to select optimal weight parameter coefficients for training the model parameters. After the model was tested using the MACCROBAT dataset, it outperformed previous models with a tremendous accuracy rate of 99.11%. This model effectively and accurately identifies biomedical names within the text, significantly advancing this field.

References

Tian Y, Shen W, Song Y, Xia F, He M, Li K . Improving biomedical named entity recognition with syntactic information. BMC Bioinformatics. 2020; 21(1):539. PMC: 7687711. DOI: 10.1186/s12859-020-03834-6. View

Wang Y, Yu Q, Tian Y, Ren S, Liu L, Wei C . Unraveling the impact of nitric oxide, almitrine, and their combination in COVID-19 (at the edge of sepsis) patients: a systematic review. Front Pharmacol. 2024; 14:1172447. PMC: 10839063. DOI: 10.3389/fphar.2023.1172447. View

Cao L, Wu C, Luo G, Guo C, Zheng A . Online biomedical named entities recognition by data and knowledge-driven model. Artif Intell Med. 2024; 150:102813. DOI: 10.1016/j.artmed.2024.102813. View

Islamaj Dogan R, Leaman R, Lu Z . NCBI disease corpus: a resource for disease name recognition and concept normalization. J Biomed Inform. 2014; 47:1-10. PMC: 3951655. DOI: 10.1016/j.jbi.2013.12.006. View

Chen P, Zhang M, Yu X, Li S . Named entity recognition of Chinese electronic medical records based on a hybrid neural network and medical MC-BERT. BMC Med Inform Decis Mak. 2022; 22(1):315. PMC: 9714133. DOI: 10.1186/s12911-022-02059-2. View

Gridach M . Character-level neural network for biomedical named entity recognition. J Biomed Inform. 2017; 70:85-91. DOI: 10.1016/j.jbi.2017.05.002. View

Chen X, Ouyang C, Liu Y, Bu Y . Improving the Named Entity Recognition of Chinese Electronic Medical Records by Combining Domain Dictionary and Rules. Int J Environ Res Public Health. 2020; 17(8). PMC: 7215438. DOI: 10.3390/ijerph17082687. View

Chai Z, Jin H, Shi S, Zhan S, Zhuo L, Yang Y . Hierarchical shared transfer learning for biomedical named entity recognition. BMC Bioinformatics. 2022; 23(1):8. PMC: 8729142. DOI: 10.1186/s12859-021-04551-4. View

Wei J, Hu T, Dai J, Wang Z, Han P, Huang W . Research on named entity recognition of adverse drug reactions based on NLP and deep learning. Front Pharmacol. 2023; 14:1121796. PMC: 10270322. DOI: 10.3389/fphar.2023.1121796. View

Luo X, Qin F, Xiao F, Cai G . BISC: accurate inference of transcriptional bursting kinetics from single-cell transcriptomic data. Brief Bioinform. 2022; 23(6). DOI: 10.1093/bib/bbac464. View

Dehghani M, Trojovsky P, Malik O . Green Anaconda Optimization: A New Bio-Inspired Metaheuristic Algorithm for Solving Optimization Problems. Biomimetics (Basel). 2023; 8(1). PMC: 10046581. DOI: 10.3390/biomimetics8010121. View

10.

Kosprdic M, Prodanovic N, Ljajic A, Basaragin B, Milosevic N . From zero to hero: Harnessing transformers for biomedical named entity recognition in zero- and few-shot contexts. Artif Intell Med. 2024; 156:102970. DOI: 10.1016/j.artmed.2024.102970. View

11.

Fabregat H, Duque A, Martinez-Romo J, Araujo L . Negation-based transfer learning for improving biomedical Named Entity Recognition and Relation Extraction. J Biomed Inform. 2023; 138:104279. DOI: 10.1016/j.jbi.2022.104279. View

12.

Li J, Sun Y, Johnson R, Sciaky D, Wei C, Leaman R . BioCreative V CDR task corpus: a resource for chemical disease relation extraction. Database (Oxford). 2016; 2016. PMC: 4860626. DOI: 10.1093/database/baw068. View

13.

Luo L, Lai P, Wei C, Arighi C, Lu Z . BioRED: a rich biomedical relation extraction dataset. Brief Bioinform. 2022; 23(5). PMC: 9487702. DOI: 10.1093/bib/bbac282. View

14.

Alamro H, Gojobori T, Essack M, Gao X . BioBBC: a multi-feature model that enhances the detection of biomedical entities. Sci Rep. 2024; 14(1):7697. PMC: 10987643. DOI: 10.1038/s41598-024-58334-x. View

15.

Ramachandran R, Arutchelvan K . Named entity recognition on bio-medical literature documents using hybrid based approach. J Ambient Intell Humaniz Comput. 2021; :1-10. PMC: 7947151. DOI: 10.1007/s12652-021-03078-z. View

16.

Sung M, Jeong M, Choi Y, Kim D, Lee J, Kang J . BERN2: an advanced neural biomedical named entity recognition and normalization tool. Bioinformatics. 2022; 38(20):4837-4839. PMC: 9563680. DOI: 10.1093/bioinformatics/btac598. View

17.

Xu Q, Jiang H, Zhang X, Li J, Chen L . Multiscale Convolutional Neural Network Based on Channel Space Attention for Gearbox Compound Fault Diagnosis. Sensors (Basel). 2023; 23(8). PMC: 10141628. DOI: 10.3390/s23083827. View

18.

Guan Z, Zhou X . A prefix and attention map discrimination fusion guided attention for biomedical named entity recognition. BMC Bioinformatics. 2023; 24(1):42. PMC: 9907889. DOI: 10.1186/s12859-023-05172-9. View