» Articles » PMID: 19114008

WGCNA: an R Package for Weighted Correlation Network Analysis

Overview
Publisher Biomed Central
Specialty Biology
Date 2008 Dec 31
PMID 19114008
Citations 11429
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Correlation networks are increasingly being used in bioinformatics applications. For example, weighted gene co-expression network analysis is a systems biology method for describing the correlation patterns among genes across microarray samples. Weighted correlation network analysis (WGCNA) can be used for finding clusters (modules) of highly correlated genes, for summarizing such clusters using the module eigengene or an intramodular hub gene, for relating modules to one another and to external sample traits (using eigengene network methodology), and for calculating module membership measures. Correlation networks facilitate network based gene screening methods that can be used to identify candidate biomarkers or therapeutic targets. These methods have been successfully applied in various biological contexts, e.g. cancer, mouse genetics, yeast genetics, and analysis of brain imaging data. While parts of the correlation network methodology have been described in separate publications, there is a need to provide a user-friendly, comprehensive, and consistent software implementation and an accompanying tutorial.

Results: The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis. The package includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software. Along with the R package we also present R software tutorials. While the methods development was motivated by gene expression data, the underlying data mining approach can be applied to a variety of different settings.

Conclusion: The WGCNA package provides R functions for weighted correlation network analysis, e.g. co-expression network analysis of gene expression data. The R package along with its source code and additional material are freely available at http://www.genetics.ucla.edu/labs/horvath/CoexpressionNetwork/Rpackages/WGCNA.

Citing Articles

LINC02363: a potential biomarker for early diagnosis and treatment of sepsis.

Leng L, Wang H, Hu Y, Hu L BMC Immunol. 2025; 26(1):23.

PMID: 40089725 DOI: 10.1186/s12865-025-00702-x.


Hordeum I genome unlocks adaptive evolution and genetic potential for crop improvement.

Feng H, Du Q, Jiang Y, Jia Y, He T, Wang Y Nat Plants. 2025; .

PMID: 40087544 DOI: 10.1038/s41477-025-01942-w.


Systematic Identification of Mitochondrial Signatures in Alzheimer's Disease and Inflammatory Bowel Disease.

Wang F, Wang J, Chen T, Wang S, Meng X, Shen Y Mol Neurobiol. 2025; .

PMID: 40085351 DOI: 10.1007/s12035-025-04826-4.


Exploring the potential mechanisms of sorafenib resistance in hepatocellular carcinoma cell lines based on RNA sequencing.

Sun M, Zhang Z, Chen C, Zhong J, Long Z, Shen L Cancer Cell Int. 2025; 25(1):91.

PMID: 40082884 PMC: 11907981. DOI: 10.1186/s12935-025-03728-8.


GWAS and transcriptome analyses unravel ZmGRAS15 regulates drought tolerance and root elongation in maize.

Wang D, Liu X, He G, Wang K, Li Y, Guan H BMC Genomics. 2025; 26(1):246.

PMID: 40082805 PMC: 11907892. DOI: 10.1186/s12864-025-11435-x.


References
1.
Watts D, Strogatz S . Collective dynamics of 'small-world' networks. Nature. 1998; 393(6684):440-2. DOI: 10.1038/30918. View

2.
Frohlich H, Speer N, Poustka A, Beissbarth T . GOSim--an R-package for computation of information theoretic GO similarities between terms and gene products. BMC Bioinformatics. 2007; 8:166. PMC: 1892785. DOI: 10.1186/1471-2105-8-166. View

3.
Oldham M, Horvath S, Geschwind D . Conservation and evolution of gene coexpression networks in human and chimpanzee brains. Proc Natl Acad Sci U S A. 2006; 103(47):17973-8. PMC: 1693857. DOI: 10.1073/pnas.0605938103. View

4.
Zhou X, Kao M, Wong W . Transitive functional annotation by shortest-path analysis of gene expression data. Proc Natl Acad Sci U S A. 2002; 99(20):12783-8. PMC: 130537. DOI: 10.1073/pnas.192159399. View

5.
Opgen-Rhein R, Strimmer K . From correlation to causation networks: a simple approximate learning algorithm and its application to high-dimensional plant gene expression data. BMC Syst Biol. 2007; 1:37. PMC: 1995222. DOI: 10.1186/1752-0509-1-37. View