» Articles » PMID: 22343431

Using Probabilistic Estimation of Expression Residuals (PEER) to Obtain Increased Power and Interpretability of Gene Expression Analyses

Overview
Journal Nat Protoc
Specialties Biology
Pathology
Science
Date 2012 Feb 21
PMID 22343431
Citations 518
Authors
Affiliations
Soon will be listed here.
Abstract

We present PEER (probabilistic estimation of expression residuals), a software package implementing statistical models that improve the sensitivity and interpretability of genetic associations in population-scale expression data. This approach builds on factor analysis methods that infer broad variance components in the measurements. PEER takes as input transcript profiles and covariates from a set of individuals, and then outputs hidden factors that explain much of the expression variability. Optionally, these factors can be interpreted as pathway or transcription factor activations by providing prior information about which genes are involved in the pathway or targeted by the factor. The inferred factors are used in genetic association analyses. First, they are treated as additional covariates, and are included in the model to increase detection power for mapping expression traits. Second, they are analyzed as phenotypes themselves to understand the causes of global expression variability. PEER extends previous related surrogate variable models and can be implemented within hours on a desktop computer.

Citing Articles

Epigenome-wide association study for dilated cardiomyopathy in left ventricular heart tissue identifies putative gene sets associated with cardiac pathology and early indicators of cardiac risk.

Tan K, Tay D, Tan W, Ng H, Wong E, Morley M Clin Epigenetics. 2025; 17(1):45.

PMID: 40057770 PMC: 11890527. DOI: 10.1186/s13148-025-01854-8.


The contribution of genetic determinants of blood gene expression and splicing to molecular phenotypes and health outcomes.

Tokolyi A, Persyn E, Nath A, Burnham K, Marten J, Vanderstichele T Nat Genet. 2025; 57(3):616-625.

PMID: 40038547 PMC: 11906350. DOI: 10.1038/s41588-025-02096-3.


Long-read RNA sequencing atlas of human microglia isoforms elucidates disease-associated genetic regulation of splicing.

Humphrey J, Brophy E, Kosoy R, Zeng B, Coccia E, Mattei D Nat Genet. 2025; 57(3):604-615.

PMID: 40033057 DOI: 10.1038/s41588-025-02099-0.


Astrocytic-supplied cholesterol drives synaptic gene expression programs in developing neurons and downstream astrocytic transcriptional programs.

Vartiainen E, Liyanage D, Mazureac I, Battaglia R, Tegtmeyer M, Xu H bioRxiv. 2025; .

PMID: 39975161 PMC: 11838310. DOI: 10.1101/2025.01.28.635252.


Genetic coupling of enhancer activity and connectivity in gene expression control.

Ray-Jones H, Sung C, Chan L, Haglund A, Artemov P, Della Rosa M Nat Commun. 2025; 16(1):970.

PMID: 39870618 PMC: 11772589. DOI: 10.1038/s41467-025-55900-3.


References
1.
Pickrell J, Marioni J, Pai A, Degner J, Engelhardt B, Nkadori E . Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature. 2010; 464(7289):768-72. PMC: 3089435. DOI: 10.1038/nature08872. View

2.
Brem R, Storey J, Whittle J, Kruglyak L . Genetic interactions between polymorphisms that affect gene expression in yeast. Nature. 2005; 436(7051):701-3. PMC: 1409747. DOI: 10.1038/nature03865. View

3.
Broman K, Wu H, Sen S, Churchill G . R/qtl: QTL mapping in experimental crosses. Bioinformatics. 2003; 19(7):889-90. DOI: 10.1093/bioinformatics/btg112. View

4.
Zhu J, Zhang B, Smith E, Drees B, Brem R, Kruglyak L . Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks. Nat Genet. 2008; 40(7):854-61. PMC: 2573859. DOI: 10.1038/ng.167. View

5.
Breitling R, Li Y, Tesson B, Fu J, Wu C, Wiltshire T . Genetical genomics: spotlight on QTL hotspots. PLoS Genet. 2008; 4(10):e1000232. PMC: 2563687. DOI: 10.1371/journal.pgen.1000232. View