» Articles » PMID: 34169279

A Computational Method for Direct Imputation of Cell Type-specific Expression Profiles and Cellular Compositions from Bulk-tissue RNA-Seq in Brain Disorders

Overview
Specialty Biology
Date 2021 Jun 25
PMID 34169279
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

The importance of cell type-specific gene expression in disease-relevant tissues is increasingly recognized in genetic studies of complex diseases. However, most gene expression studies are conducted on bulk tissues, without examining cell type-specific expression profiles. Several computational methods are available for cell type deconvolution (i.e. inference of cellular composition) from bulk RNA-Seq data, but few of them impute cell type-specific expression profiles. We hypothesize that with external prior information such as single cell RNA-seq and population-wide expression profiles, it can be computationally tractable to estimate both cellular composition and cell type-specific expression from bulk RNA-Seq data. Here we introduce CellR, which addresses cross-individual gene expression variations to adjust the weights of cell-specific gene markers. It then transforms the deconvolution problem into a linear programming model while taking into account inter/intra cellular correlations and uses a multi-variate stochastic search algorithm to estimate the cell type-specific expression profiles. Analyses on several complex diseases such as schizophrenia, Alzheimer's disease, Huntington's disease and type 2 diabetes validated the efficiency of CellR, while revealing how specific cell types contribute to different diseases. In summary, CellR compares favorably against competing approaches, enabling cell type-specific re-analysis of gene expression data on bulk tissues in complex diseases.

Citing Articles

Challenges and opportunities to computationally deconvolve heterogeneous tissue with varying cell sizes using single-cell RNA-sequencing datasets.

Maden S, Kwon S, Huuki-Myers L, Collado-Torres L, Hicks S, Maynard K Genome Biol. 2023; 24(1):288.

PMID: 38098055 PMC: 10722720. DOI: 10.1186/s13059-023-03123-4.


WormTensor: a clustering method for time-series whole-brain activity data from C. elegans.

Tsuyuzaki K, Yamamoto K, Toyoshima Y, Sato H, Kanamori M, Teramoto T BMC Bioinformatics. 2023; 24(1):254.

PMID: 37328814 PMC: 10273573. DOI: 10.1186/s12859-023-05230-2.


Detecting the effect of genetic diversity on brain composition in an Alzheimer's disease mouse model.

Gurdon B, Yates S, Csucs G, Groeneboom N, Hadad N, Telpoukhovskaia M bioRxiv. 2023; .

PMID: 36909528 PMC: 10002670. DOI: 10.1101/2023.02.27.530226.


SCADIE: simultaneous estimation of cell type proportions and cell type-specific gene expressions using SCAD-based iterative estimating procedure.

Tang D, Park S, Zhao H Genome Biol. 2022; 23(1):129.

PMID: 35706040 PMC: 9199219. DOI: 10.1186/s13059-022-02688-w.

References
1.
Yadav V, De S . An assessment of computational methods for estimating purity and clonality using genomic data derived from heterogeneous tumor tissue samples. Brief Bioinform. 2014; 16(2):232-41. PMC: 4794615. DOI: 10.1093/bib/bbu002. View

2.
Labadorf A, Hoss A, Lagomarsino V, Latourelle J, Hadzi T, Bregu J . RNA Sequence Analysis of Human Huntington Disease Brain Reveals an Extensive Increase in Inflammatory and Developmental Gene Expression. PLoS One. 2015; 10(12):e0143563. PMC: 4670106. DOI: 10.1371/journal.pone.0143563. View

3.
Abbas A, Wolslegel K, Seshasayee D, Modrusan Z, Clark H . Deconvolution of blood microarray data identifies cellular activation patterns in systemic lupus erythematosus. PLoS One. 2009; 4(7):e6098. PMC: 2699551. DOI: 10.1371/journal.pone.0006098. View

4.
Pardinas A, Holmans P, Pocklington A, Escott-Price V, Ripke S, Carrera N . Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection. Nat Genet. 2018; 50(3):381-389. PMC: 5918692. DOI: 10.1038/s41588-018-0059-2. View

5.
Zeng W, Chen X, Duren Z, Wang Y, Jiang R, Wong W . DC3 is a method for deconvolution and coupled clustering from bulk and single-cell genomics data. Nat Commun. 2019; 10(1):4613. PMC: 6787340. DOI: 10.1038/s41467-019-12547-1. View