» Articles » PMID: 27741311

Impact of the Choice of Normalization Method on Molecular Cancer Class Discovery Using Nonnegative Matrix Factorization

Overview
Journal PLoS One
Date 2016 Oct 15
PMID 27741311
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Nonnegative Matrix Factorization (NMF) has proved to be an effective method for unsupervised clustering analysis of gene expression data. By the nonnegativity constraint, NMF provides a decomposition of the data matrix into two matrices that have been used for clustering analysis. However, the decomposition is not unique. This allows different clustering results to be obtained, resulting in different interpretations of the decomposition. To alleviate this problem, some existing methods directly enforce uniqueness to some extent by adding regularization terms in the NMF objective function. Alternatively, various normalization methods have been applied to the factor matrices; however, the effects of the choice of normalization have not been carefully investigated. Here we investigate the performance of NMF for the task of cancer class discovery, under a wide range of normalization choices. After extensive evaluations, we observe that the maximum norm showed the best performance, although the maximum norm has not previously been used for NMF. Matlab codes are freely available from: http://maths.nuigalway.ie/~haixuanyang/pNMF/pNMF.htm.

Citing Articles

eDRAM: Effective early disease risk assessment with matrix factorization on a large-scale medical database: A case study on rheumatoid arthritis.

Chin C, Hsieh S, Tseng V PLoS One. 2018; 13(11):e0207579.

PMID: 30475847 PMC: 6261027. DOI: 10.1371/journal.pone.0207579.


Elastic K-means using posterior probability.

Zheng A, Jiang B, Li Y, Zhang X, Ding C PLoS One. 2017; 12(12):e0188252.

PMID: 29240756 PMC: 5730165. DOI: 10.1371/journal.pone.0188252.

References
1.
Devarajan K . Nonnegative matrix factorization: an analytical and interpretive tool in computational biology. PLoS Comput Biol. 2008; 4(7):e1000029. PMC: 2447881. DOI: 10.1371/journal.pcbi.1000029. View

2.
Lee D, Seung H . Learning the parts of objects by non-negative matrix factorization. Nature. 1999; 401(6755):788-91. DOI: 10.1038/44565. View

3.
Parker J, Mullins M, Cheang M, Leung S, Voduc D, Vickery T . Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol. 2009; 27(8):1160-7. PMC: 2667820. DOI: 10.1200/JCO.2008.18.1370. View

4.
Brunet J, Tamayo P, Golub T, Mesirov J . Metagenes and molecular pattern discovery using matrix factorization. Proc Natl Acad Sci U S A. 2004; 101(12):4164-9. PMC: 384712. DOI: 10.1073/pnas.0308531101. View

5.
Prieto C, Risueno A, Fontanillo C, De Las Rivas J . Human gene coexpression landscape: confident network derived from tissue transcriptomic profiles. PLoS One. 2008; 3(12):e3911. PMC: 2597745. DOI: 10.1371/journal.pone.0003911. View