Development and Evaluation of Novel Ophthalmology Domain-specific Neural Word Embeddings to Predict Visual Prognosis
Overview
Affiliations
Objective: To develop and evaluate novel word embeddings (WEs) specific to ophthalmology, using text corpora from published literature and electronic health records (EHR).
Materials And Methods: We trained ophthalmology-specific WEs using 121,740 PubMed abstracts and 89,282 EHR notes using word2vec continuous bag-of-words architecture. PubMed and EHR WEs were compared to general domain GloVe WEs and general biomedical domain BioWordVec embeddings using a novel ophthalmology-domain-specific 200-question analogy test and prediction of prognosis in 5547 low vision patients using EHR notes as inputs to a deep learning model.
Results: We found that many words representing important ophthalmic concepts in the EHR were missing from the general domain GloVe vocabulary, but covered in the ophthalmology abstract corpus. On ophthalmology analogy testing, PubMed WEs scored 95.0 %, outperforming EHR (86.0 %) and GloVe (91.0 %) but less than BioWordVec (99.5 %). On predicting low vision prognosis, PubMed and EHR WEs resulted in similar AUROC (0.830; 0.826), outperforming GloVe (0.778) and BioWordVec (0.784).
Conclusion: We found that using ophthalmology domain-specific WEs improved performance in ophthalmology-related clinical prediction compared to general WEs. Deep learning models using clinical notes as inputs can predict the prognosis of visually impaired patients. This work provides a framework to improve predictive models using domain-specific WEs.
Visual acuity prediction on real-life patient data using a machine learning based multistage system.
Schlosser T, Beuth F, Meyer T, Kumar A, Stolze G, Furashova O Sci Rep. 2024; 14(1):5532.
PMID: 38448469 PMC: 10917755. DOI: 10.1038/s41598-024-54482-2.
Use of artificial intelligence in forecasting glaucoma progression.
Thakur S, Dinh L, Lavanya R, Quek T, Liu Y, Cheng C Taiwan J Ophthalmol. 2023; 13(2):168-183.
PMID: 37484617 PMC: 10361424. DOI: 10.4103/tjo.TJO-D-23-00022.
Jalamangala Shivananjaiah S, Kumari S, Majid I, Wang S Front Med (Lausanne). 2023; 10:1157016.
PMID: 37122330 PMC: 10133544. DOI: 10.3389/fmed.2023.1157016.
Impact of word embedding models on text analytics in deep learning environment: a review.
Asudani D, Nagwani N, Singh P Artif Intell Rev. 2023; :1-81.
PMID: 36844886 PMC: 9944441. DOI: 10.1007/s10462-023-10419-1.
Wang S, Tseng B, Hernandez-Boussard T Ophthalmol Sci. 2022; 2(2):100127.
PMID: 36249690 PMC: 9559076. DOI: 10.1016/j.xops.2022.100127.