» Articles » PMID: 31510656

Deep Learning with Multimodal Representation for Pancancer Prognosis Prediction

Overview
Journal Bioinformatics
Specialty Biology
Date 2019 Sep 13
PMID 31510656
Citations 101
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Estimating the future course of patients with cancer lesions is invaluable to physicians; however, current clinical methods fail to effectively use the vast amount of multimodal data that is available for cancer patients. To tackle this problem, we constructed a multimodal neural network-based model to predict the survival of patients for 20 different cancer types using clinical data, mRNA expression data, microRNA expression data and histopathology whole slide images (WSIs). We developed an unsupervised encoder to compress these four data modalities into a single feature vector for each patient, handling missing data through a resilient, multimodal dropout method. Encoding methods were tailored to each data type-using deep highway networks to extract features from clinical and genomic data, and convolutional neural networks to extract features from WSIs.

Results: We used pancancer data to train these feature encodings and predict single cancer and pancancer overall survival, achieving a C-index of 0.78 overall. This work shows that it is possible to build a pancancer model for prognosis that also predicts prognosis in single cancer sites. Furthermore, our model handles multiple data modalities, efficiently analyzes WSIs and represents patient multimodal data flexibly into an unsupervised, informative representation. We thus present a powerful automated tool to accurately determine prognosis, a key step towards personalized treatment for cancer patients.

Availability And Implementation: https://github.com/gevaertlab/MultimodalPrognosis.

Citing Articles

Using mathematical modelling and AI to improve delivery and efficacy of therapies in cancer.

Harkos C, Hadjigeorgiou A, Voutouri C, Kumar A, Stylianopoulos T, Jain R Nat Rev Cancer. 2025; .

PMID: 39972158 DOI: 10.1038/s41568-025-00796-w.


MMnc: multi-modal interpretable representation for non-coding RNA classification and class annotation.

Creux C, Zehraoui F, Radvanyi F, Tahi F Bioinformatics. 2025; 41(3).

PMID: 39891346 PMC: 11890286. DOI: 10.1093/bioinformatics/btaf051.


Decoding pan-cancer treatment outcomes using multimodal real-world data and explainable artificial intelligence.

Keyl J, Keyl P, Montavon G, Hosch R, Brehmer A, Mochmann L Nat Cancer. 2025; 6(2):307-322.

PMID: 39885364 PMC: 11864985. DOI: 10.1038/s43018-024-00891-1.


Utility of Artificial Intelligence for Decision Making in Thoracic Multidisciplinary Tumor Boards.

Zabaleta J, Aguinagalde B, Lopez I, Fernandez-Monge A, Lizarbe J, Mainer M J Clin Med. 2025; 14(2).

PMID: 39860405 PMC: 11765867. DOI: 10.3390/jcm14020399.


Two-Step Transfer Learning Improves Deep Learning-Based Drug Response Prediction in Small Datasets: A Case Study of Glioblastoma.

Ju J, Ntafoulis I, Klein M, Jt Reinders M, Lamfers M, Stubbs A Bioinform Biol Insights. 2025; 19:11779322241301507.

PMID: 39763506 PMC: 11700395. DOI: 10.1177/11779322241301507.


References
1.
Christinat Y, Krek W . Integrated genomic analysis identifies subclasses and prognosis signatures of kidney cancer. Oncotarget. 2015; 6(12):10521-31. PMC: 4496372. DOI: 10.18632/oncotarget.3294. View

2.
Daemen A, Gevaert O, De Bie T, Debucquoy A, Machiels J, De Moor B . Integrating microarray and proteomics data to predict the response on cetuximab in patients with rectal cancer. Pac Symp Biocomput. 2008; :166-77. View

3.
Gevaert O, Villalobos V, Sikic B, Plevritis S . Identification of ovarian cancer driver genes by using module network integration of multi-omics data. Interface Focus. 2014; 3(4):20130013. PMC: 3915833. DOI: 10.1098/rsfs.2013.0013. View

4.
Gevaert O, De Smet F, Timmerman D, Moreau Y, De Moor B . Predicting the prognosis of breast cancer by integrating clinical and microarray data with Bayesian networks. Bioinformatics. 2006; 22(14):e184-90. DOI: 10.1093/bioinformatics/btl230. View

5.
Daemen A, Gevaert O, Ojeda F, Debucquoy A, Suykens J, Sempoux C . A kernel-based integration of genome-wide data for clinical decision support. Genome Med. 2009; 1(4):39. PMC: 2684660. DOI: 10.1186/gm39. View