» Articles » PMID: 21884587

Quantitative Utilization of Prior Biological Knowledge in the Bayesian Network Modeling of Gene Expression Data

Overview
Publisher Biomed Central
Specialty Biology
Date 2011 Sep 3
PMID 21884587
Citations 11
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Bayesian Network (BN) is a powerful approach to reconstructing genetic regulatory networks from gene expression data. However, expression data by itself suffers from high noise and lack of power. Incorporating prior biological knowledge can improve the performance. As each type of prior knowledge on its own may be incomplete or limited by quality issues, integrating multiple sources of prior knowledge to utilize their consensus is desirable.

Results: We introduce a new method to incorporate the quantitative information from multiple sources of prior knowledge. It first uses the Naïve Bayesian classifier to assess the likelihood of functional linkage between gene pairs based on prior knowledge. In this study we included cocitation in PubMed and schematic similarity in Gene Ontology annotation. A candidate network edge reservoir is then created in which the copy number of each edge is proportional to the estimated likelihood of linkage between the two corresponding genes. In network simulation the Markov Chain Monte Carlo sampling algorithm is adopted, and samples from this reservoir at each iteration to generate new candidate networks. We evaluated the new algorithm using both simulated and real gene expression data including that from a yeast cell cycle and a mouse pancreas development/growth study. Incorporating prior knowledge led to a ~2 fold increase in the number of known transcription regulations recovered, without significant change in false positive rate. In contrast, without the prior knowledge BN modeling is not always better than a random selection, demonstrating the necessity in network modeling to supplement the gene expression data with additional information.

Conclusion: our new development provides a statistical means to utilize the quantitative information in prior biological knowledge in the BN modeling of gene expression data, which significantly improves the performance.

Citing Articles

Correlation Imputation for Single-Cell RNA-seq.

Gan L, Vinci G, Allen G J Comput Biol. 2022; 29(5):465-482.

PMID: 35325552 PMC: 9125575. DOI: 10.1089/cmb.2021.0403.


Correlation Imputation in Single cell RNA-seq using Auxiliary Information and Ensemble Learning.

Gan L, Vinci G, Allen G ACM BCB. 2021; 2020.

PMID: 34278382 PMC: 8281968. DOI: 10.1145/3388440.3412462.


Comprehensive network modeling from single cell RNA sequencing of human and mouse reveals well conserved transcription regulation of hematopoiesis.

Gao S, Wu Z, Feng X, Kajigaya S, Wang X, Young N BMC Genomics. 2020; 21(Suppl 11):849.

PMID: 33372598 PMC: 7771096. DOI: 10.1186/s12864-020-07241-2.


Applications of Bayesian network models in predicting types of hematological malignancies.

Agrahari R, Foroushani A, Docking T, Chang L, Duns G, Hudoba M Sci Rep. 2018; 8(1):6951.

PMID: 29725024 PMC: 5934387. DOI: 10.1038/s41598-018-24758-5.


Differential Regulatory Analysis Based on Coexpression Network in Cancer Research.

Li J, Li Y, Li Y Biomed Res Int. 2016; 2016:4241293.

PMID: 27597964 PMC: 4997028. DOI: 10.1155/2016/4241293.


References
1.
Lechner A, Habener J . Stem/progenitor cells derived from adult tissues: potential for the treatment of diabetes mellitus. Am J Physiol Endocrinol Metab. 2003; 284(2):E259-66. DOI: 10.1152/ajpendo.00393.2002. View

2.
Gao S, Hartman 4th J, Carter J, Hessner M, Wang X . Global analysis of phase locking in gene expression during cell cycle: the potential in network modeling. BMC Syst Biol. 2010; 4:167. PMC: 3017040. DOI: 10.1186/1752-0509-4-167. View

3.
Imoto S, Goto T, Miyano S . Estimation of genetic networks and functional structures between genes by using Bayesian networks and nonparametric regression. Pac Symp Biocomput. 2002; :175-86. View

4.
Imoto S, Higuchi T, Goto T, Tashiro K, Kuhara S, Miyano S . Combining microarrays and biological knowledge for estimating gene networks via bayesian networks. J Bioinform Comput Biol. 2004; 2(1):77-98. DOI: 10.1142/s021972000400048x. View

5.
Friedman N, Linial M, Nachman I, Peer D . Using Bayesian networks to analyze expression data. J Comput Biol. 2000; 7(3-4):601-20. DOI: 10.1089/106652700750050961. View