» Articles » PMID: 19958477

Measuring Similarity Between Gene Expression Profiles: a Bayesian Approach

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2009 Dec 5
PMID 19958477
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Grouping genes into clusters on the basis of similarity between their expression profiles has been the main approach to predict functional modules, from which important inference or further investigation decision could be made. While the univocal determination of similarity metric is important, current practices are normally involved with Euclidean distance and Pearson correlation, of which assumptions are not likely the case for high-throughput microarray data.

Results: We advocate the use of a novel metric - BayesGen - to measure similarity between gene expression profiles, and demonstrate its performance on two important applications: constructing genome-wide co-expression network, and clustering cancer human tissues into subtypes. BayesGen is formulated as the evidence ratio between two alternative hypotheses about the generating mechanism of a given pair of genes, and incorporates as prior knowledge the global characteristics of the whole dataset. Through the joint modelling of expected intensity levels and noise variances, it addresses the inherent nonlinearity and the association of noise levels across different microarray value ranges. The full Bayesian formulation also facilitates the possibility of meta-analysis.

Conclusion: BayesGen allows more effective extraction of similarity information between genes from microarray expression data, which has significant effect on various inference tasks. It also provides a robust choice for other object-feature data, as illustrated through the results of the test on synthetic data.

Citing Articles

Detection of an anti-angina therapeutic module in the effective population treated by a multi-target drug Danhong injection: a randomized trial.

Li D, Dong W, Liu Y, Wu Y, Tang D, Zhang F Signal Transduct Target Ther. 2021; 6(1):329.

PMID: 34471087 PMC: 8410855. DOI: 10.1038/s41392-021-00741-x.


Single cell analysis of adult mouse skeletal muscle stem cells in homeostatic and regenerative conditions.

DellOrso S, Juan A, Ko K, Naz F, Perovanovic J, Gutierrez-Cruz G Development. 2019; 146(12).

PMID: 30890574 PMC: 6602351. DOI: 10.1242/dev.174177.


Statistical approaches to use a model organism for regulatory sequences annotation of newly sequenced species.

Lio P, Angelini C, De Feis I, Nguyen V PLoS One. 2012; 7(9):e42489.

PMID: 22984403 PMC: 3439465. DOI: 10.1371/journal.pone.0042489.


Comprehensive literature review and statistical considerations for microarray meta-analysis.

Tseng G, Ghosh D, Feingold E Nucleic Acids Res. 2012; 40(9):3785-99.

PMID: 22262733 PMC: 3351145. DOI: 10.1093/nar/gkr1265.


Extending Asia Pacific bioinformatics into new realms in the "-omics" era.

Ranganathan S, Eisenhaber F, Tong J, Tan T BMC Genomics. 2009; 10 Suppl 3:S1.

PMID: 19958472 PMC: 2788361. DOI: 10.1186/1471-2164-10-S3-S1.

References
1.
Zhao J, Foulkes A, George E . Exploratory Bayesian model selection for serial genetics data. Biometrics. 2005; 61(2):591-9. DOI: 10.1111/j.1541-0420.2005.040417.x. View

2.
Myers C, Barrett D, Hibbs M, Huttenhower C, Troyanskaya O . Finding function: evaluation methods for functional genomic data. BMC Genomics. 2006; 7:187. PMC: 1560386. DOI: 10.1186/1471-2164-7-187. View

3.
Ashburner M, Ball C, Blake J, Botstein D, Butler H, Cherry J . Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000; 25(1):25-9. PMC: 3037419. DOI: 10.1038/75556. View

4.
Ball C, Dolinski K, Dwight S, Harris M, Kasarskis A, Scafe C . Integrating functional genomic information into the Saccharomyces genome database. Nucleic Acids Res. 1999; 28(1):77-80. PMC: 102447. DOI: 10.1093/nar/28.1.77. View

5.
Wakefield J . Bayes factors for genome-wide association studies: comparison with P-values. Genet Epidemiol. 2008; 33(1):79-86. DOI: 10.1002/gepi.20359. View