» Articles » PMID: 27922124

Convex Analysis of Mixtures for Separating Non-negative Well-grounded Sources

Overview
Journal Sci Rep
Specialty Science
Date 2016 Dec 7
PMID 27922124
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Blind Source Separation (BSS) is a powerful tool for analyzing composite data patterns in many areas, such as computational biology. We introduce a novel BSS method, Convex Analysis of Mixtures (CAM), for separating non-negative well-grounded sources, which learns the mixing matrix by identifying the lateral edges of the convex data scatter plot. We propose and prove a sufficient and necessary condition for identifying the mixing matrix through edge detection in the noise-free case, which enables CAM to identify the mixing matrix not only in the exact-determined and over-determined scenarios, but also in the under-determined scenario. We show the optimality of the edge detection strategy, even for cases where source well-groundedness is not strictly satisfied. The CAM algorithm integrates plug-in noise filtering using sector-based clustering, an efficient geometric convex analysis scheme, and stability-based model order selection. The superior performance of CAM against a panel of benchmark BSS techniques is demonstrated on numerically mixed gene expression data of ovarian cancer subtypes. We apply CAM to dissect dynamic contrast-enhanced magnetic resonance imaging data taken from breast tumors and time-course microarray gene expression data derived from in-vivo muscle regeneration in mice, both producing biologically plausible decomposition results.

Citing Articles

Comprehensive evaluation of deconvolution methods for human brain gene expression.

J Sutton G, Poppe D, Simmons R, Walsh K, Nawaz U, Lister R Nat Commun. 2022; 13(1):1358.

PMID: 35292647 PMC: 8924248. DOI: 10.1038/s41467-022-28655-4.


Molecular characterization of projection neuron subtypes in the mouse olfactory bulb.

Zeppilli S, Ackels T, Attey R, Klimpert N, Ritola K, Boeing S Elife. 2021; 10.

PMID: 34292150 PMC: 8352594. DOI: 10.7554/eLife.65445.


Identification of Putative Early Atherosclerosis Biomarkers by Unsupervised Deconvolution of Heterogeneous Vascular Proteomes.

Parker S, Chen L, Spivia W, Saylor G, Mao C, Venkatraman V J Proteome Res. 2020; 19(7):2794-2806.

PMID: 32202800 PMC: 7720636. DOI: 10.1021/acs.jproteome.0c00118.


Mathematical modelling of transcriptional heterogeneity identifies novel markers and subpopulations in complex tissues.

Wang N, Hoffman E, Chen L, Chen L, Zhang Z, Liu C Sci Rep. 2016; 6:18909.

PMID: 26739359 PMC: 4703969. DOI: 10.1038/srep18909.

References
1.
Wang F, Chi C, Chan T, Wang Y . Nonnegative least-correlated component analysis for separation of dependent sources by volume maximization. IEEE Trans Pattern Anal Mach Intell. 2010; 32(5):875-88. DOI: 10.1109/TPAMI.2009.72. View

2.
Ding C, Li T, Jordan M . Convex and semi-nonnegative matrix factorizations. IEEE Trans Pattern Anal Mach Intell. 2009; 32(1):45-55. DOI: 10.1109/TPAMI.2008.277. View

3.
Astakhov S, Stogbauer H, Kraskov A, Grassberger P . Monte Carlo algorithm for least dependent non-negative mixture decomposition. Anal Chem. 2006; 78(5):1620-7. DOI: 10.1021/ac051707c. View

4.
Lee D, Seung H . Learning the parts of objects by non-negative matrix factorization. Nature. 1999; 401(6755):788-91. DOI: 10.1038/44565. View

5.
Schwartz D, Kardia S, Shedden K, Kuick R, Michailidis G, Taylor J . Gene expression in ovarian cancer reflects both morphology and biological behavior, distinguishing clear cell from other poor-prognosis ovarian carcinomas. Cancer Res. 2002; 62(16):4722-9. View