» Articles » PMID: 20834038

Semi-supervised Recursively Partitioned Mixture Models for Identifying Cancer Subtypes

Overview
Journal Bioinformatics
Specialty Biology
Date 2010 Sep 14
PMID 20834038
Citations 28
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Patients with identical cancer diagnoses often progress differently. The disparity we see in disease progression and treatment response can be attributed to the idea that two histologically similar cancers may be completely different diseases on the molecular level. Methods for identifying cancer subtypes associated with patient survival have the capacity to be powerful instruments for understanding the biochemical processes that underlie disease progression as well as providing an initial step toward more personalized therapy for cancer patients. We propose a method called semi-supervised recursively partitioned mixture models (SS-RPMM) that utilizes array-based genetic and patient-level clinical data for finding cancer subtypes that are associated with patient survival.

Results: In the proposed SS-RPMM, cancer subtypes are identified using a selected subset of genes that are associated with survival time. Since survival information is used in the gene selection step, this method is semi-supervised. Unlike other semi-supervised clustering classification methods, SS-RPMM does not require specification of the number of cancer subtypes, which is often unknown. In a simulation study, our proposed method compared favorably with other competing semi-supervised methods, including: semi-supervised clustering and supervised principal components analysis. Furthermore, an analysis of mesothelioma cancer data using SS-RPMM, revealed at least two distinct methylation profiles that are informative for survival.

Availability: The analyses implemented in this article were carried out using R (http://www.r.project.org/).

Contact: devin_koestler@brown.edu; e_andres_houseman@brown.edu

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

Evaluation of agreement between common clustering strategies for DNA methylation-based subtyping of breast tumours.

Zarean E, Li S, Wong E, Makalic E, Milne R, Giles G Epigenomics. 2024; 17(2):105-114.

PMID: 39711216 PMC: 11792870. DOI: 10.1080/17501911.2024.2441653.


Genome-Scale Methylation Analysis Identifies Immune Profiles and Age Acceleration Associations with Bladder Cancer Outcomes.

Chen J, Salas L, Wiencke J, Koestler D, Molinaro A, Andrew A Cancer Epidemiol Biomarkers Prev. 2023; 32(10):1328-1337.

PMID: 37527159 PMC: 10543967. DOI: 10.1158/1055-9965.EPI-23-0331.


Comparative Transcriptome Profiling Reveals the Genes Involved in Storage Root Expansion in Sweetpotato ( (L.) Lam.).

Song W, Yan H, Ma M, Kou M, Li C, Tang W Genes (Basel). 2022; 13(7).

PMID: 35885939 PMC: 9321896. DOI: 10.3390/genes13071156.


Subject level clustering using a negative binomial model for small transcriptomic studies.

Li Q, Noel-MacDonnell J, Koestler D, Goode E, Fridley B BMC Bioinformatics. 2018; 19(1):474.

PMID: 30541426 PMC: 6292049. DOI: 10.1186/s12859-018-2556-9.


Identification of relevant subtypes via preweighted sparse clustering.

Gaynor S, Bair E Comput Stat Data Anal. 2018; 116:139-154.

PMID: 29785064 PMC: 5959300. DOI: 10.1016/j.csda.2017.06.003.


References
1.
Alizadeh A, Eisen M, Davis R, Ma C, Lossos I, Rosenwald A . Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature. 2000; 403(6769):503-11. DOI: 10.1038/35000501. View

2.
Marsit C, Christensen B, Houseman E, Karagas M, Wrensch M, Yeh R . Epigenetic profiling reveals etiologically distinct patterns of DNA methylation in head and neck squamous cell carcinoma. Carcinogenesis. 2009; 30(3):416-22. PMC: 2650795. DOI: 10.1093/carcin/bgp006. View

3.
Houseman E, Coull B, Betensky R . Feature-specific penalized latent class analysis for genomic data. Biometrics. 2006; 62(4):1062-70. DOI: 10.1111/j.1541-0420.2006.00566.x. View

4.
Deneberg S, Grovdal M, Karimi M, Jansson M, Nahi H, Corbacioglu A . Gene-specific and global methylation patterns predict outcome in patients with acute myeloid leukemia. Leukemia. 2010; 24(5):932-41. DOI: 10.1038/leu.2010.41. View

5.
Hou J, Aerts J, den Hamer B, van IJcken W, den Bakker M, Riegman P . Gene expression-based classification of non-small cell lung carcinomas and survival prediction. PLoS One. 2010; 5(4):e10312. PMC: 2858668. DOI: 10.1371/journal.pone.0010312. View