» Articles » PMID: 22583800

SPICE: Discovery of Phenotype-determining Component Interplays

Overview
Journal BMC Syst Biol
Publisher Biomed Central
Specialty Biology
Date 2012 May 16
PMID 22583800
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

Background: A latent behavior of a biological cell is complex. Deriving the underlying simplicity, or the fundamental rules governing this behavior has been the Holy Grail of systems biology. Data-driven prediction of the system components and their component interplays that are responsible for the target system's phenotype is a key and challenging step in this endeavor.

Results: The proposed approach, which we call System Phenotype-related Interplaying Components Enumerator (SPICE), iteratively enumerates statistically significant system components that are hypothesized (1) to play an important role in defining the specificity of the target system's phenotype(s); (2) to exhibit a functionally coherent behavior, namely, act in a coordinated manner to perform the phenotype-specific function; and (3) to improve the predictive skill of the system's phenotype(s) when used collectively in the ensemble of predictive models. SPICE can be applied to both instance-based data and network-based data. When validated, SPICE effectively identified system components related to three target phenotypes: biohydrogen production, motility, and cancer. Manual results curation agreed with the known phenotype-related system components reported in literature. Additionally, using the identified system components as discriminatory features improved the prediction accuracy by 10% on the phenotype-classification task when compared to a number of state-of-the-art methods applied to eight benchmark microarray data sets.

Conclusion: We formulate a problem--enumeration of phenotype-determining system component interplays--and propose an effective methodology (SPICE) to address this problem. SPICE improved identification of cancer-related groups of genes from various microarray data sets and detected groups of genes associated with microbial biohydrogen production and motility, many of which were reported in literature. SPICE also improved the predictive skill of the system's phenotype determination compared to individual classifiers and/or other ensemble methods, such as bagging, boosting, random forest, nearest shrunken centroid, and random forest variable selection method.

Citing Articles

Complex biomarker discovery in neuroimaging data: Finding a needle in a haystack.

Atluri G, Padmanabhan K, Fang G, Steinbach M, Petrella J, Lim K Neuroimage Clin. 2013; 3:123-31.

PMID: 24179856 PMC: 3791294. DOI: 10.1016/j.nicl.2013.07.004.

References
1.
Rey F, Oda Y, Harwood C . Regulation of uptake hydrogenase and effects of hydrogen utilization on gene expression in Rhodopseudomonas palustris. J Bacteriol. 2006; 188(17):6143-52. PMC: 1595397. DOI: 10.1128/JB.00381-06. View

2.
Johannes M, Brase J, Frohlich H, Gade S, Gehrmann M, Falth M . Integration of pathway knowledge into a reweighted recursive feature elimination approach for risk stratification of cancer patients. Bioinformatics. 2010; 26(17):2136-44. DOI: 10.1093/bioinformatics/btq345. View

3.
He X, Yan S, Hu Y, Niyogi P, Zhang H . Face recognition using laplacianfaces. IEEE Trans Pattern Anal Mach Intell. 2005; 27(3):328-340. DOI: 10.1109/TPAMI.2005.55. View

4.
Vignais P, Colbeau A . Molecular biology of microbial hydrogenases. Curr Issues Mol Biol. 2004; 6(2):159-88. View

5.
Shomura Y, Komori H, Miyabe N, Tomiyama M, Shibata N, Higuchi Y . Crystal structures of hydrogenase maturation protein HypE in the Apo and ATP-bound forms. J Mol Biol. 2007; 372(4):1045-1054. DOI: 10.1016/j.jmb.2007.07.023. View