» Articles » PMID: 34377946

A Two-stage Approach for Combining Gene Expression and Mutation with Clinical Data Improves Survival Prediction in Myelodysplastic Syndromes and Ovarian Cancer

Overview
Date 2021 Aug 11
PMID 34377946
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Many traditional clinical prognostic factors have been known for cancer for years, but usually provide poor survival prediction. Genomic information is more easily available now which offers opportunities to build more accurate prognostic models. The challenge is how to integrate them to improve survival prediction. The common approach of jointly analyzing all type of covariates directly in one single model may not improve the prediction due to increased model complexity and cannot be easily applied to different datasets.

Results: We proposed a two-stage procedure to better combine different sources of information for survival prediction, and applied the two-stage procedure in two cancer datasets: myelodysplastic syndromes (MDS) and ovarian cancer. Our analysis suggests that the prediction performance of different data types are very different, and combining clinical, gene expression and mutation data using the two-stage procedure improves survival prediction in terms of improved concordance index and reduced prediction error.

Availability And Implementation: The two-stage procedure can be implemented in BhGLM package which is freely available at http://www.ssg.uab.edu/bhglm/.

Contact: nyi@uab.edu.

References
1.
Corey S, Minden M, Barber D, Kantarjian H, Wang J, Schimmer A . Myelodysplastic syndromes: the complexity of stem-cell diseases. Nat Rev Cancer. 2007; 7(2):118-29. DOI: 10.1038/nrc2047. View

2.
Beekman R, Valkhof M, Erkeland S, Taskesen E, Rockova V, Peeters J . Retroviral integration mutagenesis in mice and comparative analysis in human AML identify reduced PTP4A3 expression as a prognostic indicator. PLoS One. 2011; 6(10):e26537. PMC: 3197662. DOI: 10.1371/journal.pone.0026537. View

3.
Tibshirani R, Efron B . Pre-validation and inference in microarrays. Stat Appl Genet Mol Biol. 2006; 1:Article1. DOI: 10.2202/1544-6115.1000. View

4.
Zhang J, Liu X, Datta A, Govindarajan K, Tam W, Han J . RCP is a human breast cancer-promoting gene with Ras-activating function. J Clin Invest. 2009; 119(8):2171-83. PMC: 2719918. DOI: 10.1172/JCI37622. View

5.
Partheen K, Levan K, Osterberg L, Horvath G . Expression analysis of stage III serous ovarian adenocarcinoma distinguishes a sub-group of survivors. Eur J Cancer. 2006; 42(16):2846-54. DOI: 10.1016/j.ejca.2006.06.026. View