» Articles » PMID: 22383870

Robust Detection of Hierarchical Communities from Escherichia Coli Gene Expression Data

Overview
Specialty Biology
Date 2012 Mar 3
PMID 22383870
Citations 18
Authors
Affiliations
Soon will be listed here.
Abstract

Determining the functional structure of biological networks is a central goal of systems biology. One approach is to analyze gene expression data to infer a network of gene interactions on the basis of their correlated responses to environmental and genetic perturbations. The inferred network can then be analyzed to identify functional communities. However, commonly used algorithms can yield unreliable results due to experimental noise, algorithmic stochasticity, and the influence of arbitrarily chosen parameter values. Furthermore, the results obtained typically provide only a simplistic view of the network partitioned into disjoint communities and provide no information of the relationship between communities. Here, we present methods to robustly detect co-regulated and functionally enriched gene communities and demonstrate their application and validity for Escherichia coli gene expression data. Applying a recently developed community detection algorithm to the network of interactions identified with the context likelihood of relatedness (CLR) method, we show that a hierarchy of network communities can be identified. These communities significantly enrich for gene ontology (GO) terms, consistent with them representing biologically meaningful groups. Further, analysis of the most significantly enriched communities identified several candidate new regulatory interactions. The robustness of our methods is demonstrated by showing that a core set of functional communities is reliably found when artificial noise, modeling experimental noise, is added to the data. We find that noise mainly acts conservatively, increasing the relatedness required for a network link to be reliably assigned and decreasing the size of the core communities, rather than causing association of genes into new communities.

Citing Articles

From components to communities: bringing network science to clustering for molecular epidemiology.

Liu M, Chato C, Poon A Virus Evol. 2023; 9(1):vead026.

PMID: 37187604 PMC: 10175948. DOI: 10.1093/ve/vead026.


Inferencing Bulk Tumor and Single-Cell Multi-Omics Regulatory Networks for Discovery of Biomarkers and Therapeutic Targets.

Ye Q, Guo N Cells. 2023; 12(1).

PMID: 36611894 PMC: 9818242. DOI: 10.3390/cells12010101.


Data Mining a Medieval Medical Text Reveals Patterns in Ingredient Choice That Reflect Biological Activity against Infectious Agents.

Connelly E, Del Genio C, Harrison F mBio. 2020; 11(1).

PMID: 32047130 PMC: 7018648. DOI: 10.1128/mBio.03136-19.


MetaOmGraph: a workbench for interactive exploratory data analysis of large expression datasets.

Singh U, Hur M, Dorman K, Syrkin Wurtele E Nucleic Acids Res. 2020; 48(4):e23.

PMID: 31956905 PMC: 7039010. DOI: 10.1093/nar/gkz1209.


Diversity in Operon Regulation among Diverse Escherichia coli Isolates Depends on the Broader Genetic Background but Is Not Explained by Genetic Relatedness.

Phillips K, Widmann S, Lai H, Nguyen J, Ray J, Balazsi G mBio. 2019; 10(6).

PMID: 31719176 PMC: 6851279. DOI: 10.1128/mBio.02232-19.


References
1.
Raghavan U, Albert R, Kumara S . Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2007; 76(3 Pt 2):036106. DOI: 10.1103/PhysRevE.76.036106. View

2.
Shi Z, Derow C, Zhang B . Co-expression module analysis reveals biological processes, genomic gain, and regulatory mechanisms associated with breast cancer progression. BMC Syst Biol. 2010; 4:74. PMC: 2902438. DOI: 10.1186/1752-0509-4-74. View

3.
Newman M . Modularity and community structure in networks. Proc Natl Acad Sci U S A. 2006; 103(23):8577-82. PMC: 1482622. DOI: 10.1073/pnas.0601602103. View

4.
Newman M, Girvan M . Finding and evaluating community structure in networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2004; 69(2 Pt 2):026113. DOI: 10.1103/PhysRevE.69.026113. View

5.
Baggerly K, Coombes K, Neeley E . Run batch effects potentially compromise the usefulness of genomic signatures for ovarian cancer. J Clin Oncol. 2008; 26(7):1186-7. DOI: 10.1200/JCO.2007.15.1951. View