» Articles » PMID: 39624040

ILDIM-MFAM: Interstitial Lung Disease Identification Model with Multi-modal Fusion Attention Mechanism

Overview
Specialty General Medicine
Date 2024 Dec 3
PMID 39624040
Authors
Affiliations
Soon will be listed here.
Abstract

This study aims to address the potential and challenges of multimodal medical information in the diagnosis of interstitial lung disease (ILD) by developing an ILD identification model (ILDIM) based on the multimodal fusion attention mechanism (MFAM) to improve the accuracy and reliability of ILD. Large-scale multimodal medical information data, including chest CT image slices, physiological indicator time series data, and patient history text information were collected. These data are professionally cleaned and normalized to ensure data quality and consistency. Convolutional Neural Network (CNN) is used to extract CT image features, Bidirectional Long Short-Term Memory Network (Bi-LSTM) model is used to learn temporal physiological metrics data under long-term dependency, and Self-Attention Mechanism is used to encode textual semantic information in patient's self-reporting and medical prescriptions. In addition, the multimodal perception mechanism uses a Transformer-based model to improve the diagnostic performance of ILD by learning the importance weights of each modality's data to optimally fuse the different modalities. Finally, the ablation test and comparison results show that the model performs well in terms of comprehensive performance. By combining multimodal data sources, the model not only improved the Precision, Recall and F1 score, but also significantly increased the AUC value. This suggests that the combined use of different modal information can provide a more comprehensive assessment of a patient's health status, thereby improving the diagnostic comprehensiveness and accuracy of ILD. This study also considered the computational complexity of the model, and the results show that ILDIM-MFAM has a relatively low number of model parameters and computational complexity, which is very favorable for practical deployment and operational efficiency.

References
1.
Zhou H, Yu Y, Wang C, Zhang S, Gao Y, Pan J . A transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics. Nat Biomed Eng. 2023; 7(6):743-755. DOI: 10.1038/s41551-023-01045-x. View

2.
Zhong B, Luo S . Identifying the link between serum VEGF and KL-6 concentrations: a correlation analysis for idiopathic pulmonary fibrosis interstitial lung disease progression. Front Med (Lausanne). 2023; 10:1282757. PMC: 10722179. DOI: 10.3389/fmed.2023.1282757. View

3.
Chaudhuri N, Spencer L, Greaves M, Bishop P, Chaturvedi A, Leonard C . A Review of the Multidisciplinary Diagnosis of Interstitial Lung Diseases: A Retrospective Analysis in a Single UK Specialist Centre. J Clin Med. 2016; 5(8). PMC: 4999786. DOI: 10.3390/jcm5080066. View

4.
Depeursinge A, Racoceanu D, Iavindrasana J, Cohen G, Platon A, Poletti P . Fusing visual and clinical information for lung tissue classification in high-resolution computed tomography. Artif Intell Med. 2010; 50(1):13-21. DOI: 10.1016/j.artmed.2010.04.006. View

5.
Ryerson C, Vittinghoff E, Ley B, Lee J, Mooney J, Jones K . Predicting survival across chronic interstitial lung disease: the ILD-GAP model. Chest. 2013; 145(4):723-728. DOI: 10.1378/chest.13-1474. View