A Computational Method for Direct Imputation of Cell Type-specific Expression Profiles and Cellular Compositions from Bulk-tissue RNA-Seq in Brain Disorders
Overview
Affiliations
The importance of cell type-specific gene expression in disease-relevant tissues is increasingly recognized in genetic studies of complex diseases. However, most gene expression studies are conducted on bulk tissues, without examining cell type-specific expression profiles. Several computational methods are available for cell type deconvolution (i.e. inference of cellular composition) from bulk RNA-Seq data, but few of them impute cell type-specific expression profiles. We hypothesize that with external prior information such as single cell RNA-seq and population-wide expression profiles, it can be computationally tractable to estimate both cellular composition and cell type-specific expression from bulk RNA-Seq data. Here we introduce CellR, which addresses cross-individual gene expression variations to adjust the weights of cell-specific gene markers. It then transforms the deconvolution problem into a linear programming model while taking into account inter/intra cellular correlations and uses a multi-variate stochastic search algorithm to estimate the cell type-specific expression profiles. Analyses on several complex diseases such as schizophrenia, Alzheimer's disease, Huntington's disease and type 2 diabetes validated the efficiency of CellR, while revealing how specific cell types contribute to different diseases. In summary, CellR compares favorably against competing approaches, enabling cell type-specific re-analysis of gene expression data on bulk tissues in complex diseases.
Maden S, Kwon S, Huuki-Myers L, Collado-Torres L, Hicks S, Maynard K Genome Biol. 2023; 24(1):288.
PMID: 38098055 PMC: 10722720. DOI: 10.1186/s13059-023-03123-4.
WormTensor: a clustering method for time-series whole-brain activity data from C. elegans.
Tsuyuzaki K, Yamamoto K, Toyoshima Y, Sato H, Kanamori M, Teramoto T BMC Bioinformatics. 2023; 24(1):254.
PMID: 37328814 PMC: 10273573. DOI: 10.1186/s12859-023-05230-2.
Gurdon B, Yates S, Csucs G, Groeneboom N, Hadad N, Telpoukhovskaia M bioRxiv. 2023; .
PMID: 36909528 PMC: 10002670. DOI: 10.1101/2023.02.27.530226.
Tang D, Park S, Zhao H Genome Biol. 2022; 23(1):129.
PMID: 35706040 PMC: 9199219. DOI: 10.1186/s13059-022-02688-w.