» Articles » PMID: 25583121

CellCODE: a Robust Latent Variable Approach to Differential Expression Analysis for Heterogeneous Cell Populations

Overview
Journal Bioinformatics
Specialty Biology
Date 2015 Jan 14
PMID 25583121
Citations 59
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Identifying alterations in gene expression associated with different clinical states is important for the study of human biology. However, clinical samples used in gene expression studies are often derived from heterogeneous mixtures with variable cell-type composition, complicating statistical analysis. Considerable effort has been devoted to modeling sample heterogeneity, and presently, there are many methods that can estimate cell proportions or pure cell-type expression from mixture data. However, there is no method that comprehensively addresses mixture analysis in the context of differential expression without relying on additional proportion information, which can be inaccurate and is frequently unavailable.

Results: In this study, we consider a clinically relevant situation where neither accurate proportion estimates nor pure cell expression is of direct interest, but where we are rather interested in detecting and interpreting relevant differential expression in mixture samples. We develop a method, Cell-type COmputational Differential Estimation (CellCODE), that addresses the specific statistical question directly, without requiring a physical model for mixture components. Our approach is based on latent variable analysis and is computationally transparent; it requires no additional experimental data, yet outperforms existing methods that use independent proportion measurements. CellCODE has few parameters that are robust and easy to interpret. The method can be used to track changes in proportion, improve power to detect differential expression and assign the differentially expressed genes to the correct cell type.

Citing Articles

Multi Layered Omics Approaches Reveal Glia Specific Alterations in Alzheimer's Disease: A Systematic Review and Future Prospects.

Is O, Min Y, Wang X, Oatman S, Abraham Daniel A, Ertekin-Taner N Glia. 2024; 73(3):539-573.

PMID: 39652363 PMC: 11784841. DOI: 10.1002/glia.24652.


Embracing the informative missingness and silent gene in analyzing biologically diverse samples.

Du D, Bhardwaj S, Lu Y, Wang Y, Parker S, Zhang Z Sci Rep. 2024; 14(1):28265.

PMID: 39550430 PMC: 11569126. DOI: 10.1038/s41598-024-78076-0.


Rapid iPSC inclusionopathy models shed light on formation, consequence, and molecular subtype of α-synuclein inclusions.

Lam I, Ndayisaba A, Lewis A, Fu Y, Sagredo G, Kuzkina A Neuron. 2024; 112(17):2886-2909.e16.

PMID: 39079530 PMC: 11377155. DOI: 10.1016/j.neuron.2024.06.002.


ABDS: a bioinformatics tool suite for analyzing biologically diverse samples.

Du D, Bhardwaj S, Lu Y, Wang Y, Parker S, Zhang Z Res Sq. 2024; .

PMID: 38853832 PMC: 11160903. DOI: 10.21203/rs.3.rs-4419408/v1.


Alzheimer's disease rewires gene coexpression networks coupling different brain regions.

Mitra S, Bp K, C R S, Saikumar N, Philip P, Narayanan M NPJ Syst Biol Appl. 2024; 10(1):50.

PMID: 38724582 PMC: 11082197. DOI: 10.1038/s41540-024-00376-y.


References
1.
Novershtern N, Subramanian A, Lawton L, Mak R, Nicholas Haining W, McConkey M . Densely interconnected transcriptional circuits control cell states in human hematopoiesis. Cell. 2011; 144(2):296-309. PMC: 3049864. DOI: 10.1016/j.cell.2011.01.004. View

2.
Nakaya H, Wrammert J, Lee E, Racioppi L, Marie-Kunze S, Nicholas Haining W . Systems biology of vaccination for seasonal influenza in humans. Nat Immunol. 2011; 12(8):786-95. PMC: 3140559. DOI: 10.1038/ni.2067. View

3.
Shen-Orr S, Tibshirani R, Khatri P, Bodian D, Staedtler F, Perry N . Cell type-specific gene expression differences in complex tissues. Nat Methods. 2010; 7(4):287-9. PMC: 3699332. DOI: 10.1038/nmeth.1439. View

4.
Repsilber D, Kern S, Telaar A, Walzl G, Black G, Selbig J . Biomarker discovery in heterogeneous tissue samples -taking the in-silico deconfounding approach. BMC Bioinformatics. 2010; 11:27. PMC: 3098067. DOI: 10.1186/1471-2105-11-27. View

5.
Langfelder P, Horvath S . WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008; 9:559. PMC: 2631488. DOI: 10.1186/1471-2105-9-559. View