» Articles » PMID: 15833142

Filtering High-throughput Protein-protein Interaction Data Using a Combination of Genomic Features

Overview
Publisher Biomed Central
Specialty Biology
Date 2005 Apr 19
PMID 15833142
Citations 65
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Protein-protein interaction data used in the creation or prediction of molecular networks is usually obtained from large scale or high-throughput experiments. This experimental data is liable to contain a large number of spurious interactions. Hence, there is a need to validate the interactions and filter out the incorrect data before using them in prediction studies.

Results: In this study, we use a combination of 3 genomic features -- structurally known interacting Pfam domains, Gene Ontology annotations and sequence homology -- as a means to assign reliability to the protein-protein interactions in Saccharomyces cerevisiae determined by high-throughput experiments. Using Bayesian network approaches, we show that protein-protein interactions from high-throughput data supported by one or more genomic features have a higher likelihood ratio and hence are more likely to be real interactions. Our method has a high sensitivity (90%) and good specificity (63%). We show that 56% of the interactions from high-throughput experiments in Saccharomyces cerevisiae have high reliability. We use the method to estimate the number of true interactions in the high-throughput protein-protein interaction data sets in Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens to be 27%, 18% and 68% respectively. Our results are available for searching and downloading at http://helix.protein.osaka-u.ac.jp/htp/.

Conclusion: A combination of genomic features that include sequence, structure and annotation information is a good predictor of true interactions in large and noisy high-throughput data sets. The method has a very high sensitivity and good specificity and can be used to assign a likelihood ratio, corresponding to the reliability, to each interaction.

Citing Articles

Heterogeneous network approaches to protein pathway prediction.

Nayar G, Altman R Comput Struct Biotechnol J. 2024; 23:2727-2739.

PMID: 39035835 PMC: 11260399. DOI: 10.1016/j.csbj.2024.06.022.


AURKA inhibition induces Ewing's sarcoma apoptosis and ferroptosis through NPM1/YAP1 axis.

Chen H, Hu J, Xiong X, Chen H, Lin B, Chen Y Cell Death Dis. 2024; 15(1):99.

PMID: 38287009 PMC: 10825207. DOI: 10.1038/s41419-024-06485-0.


Circular RNA ZBTB46 depletion alleviates the progression of Atherosclerosis by regulating the ubiquitination and degradation of hnRNPA2B1 via the AKT/mTOR pathway.

Fu Y, Jia Q, Ren M, Bie H, Zhang X, Zhang Q Immun Ageing. 2023; 20(1):66.

PMID: 37990246 PMC: 10662463. DOI: 10.1186/s12979-023-00386-0.


Computational approaches for the design of modulators targeting protein-protein interactions.

Rehman A, Khurshid B, Ali Y, Rasheed S, Wadood A, Ng H Expert Opin Drug Discov. 2023; 18(3):315-333.

PMID: 36715303 PMC: 10149343. DOI: 10.1080/17460441.2023.2171396.


BTC as a Novel Biomarker Contributing to EMT the PI3K-AKT Pathway in OSCC.

Shen T, Yang T, Yao M, Zheng Z, He M, Shao M Front Genet. 2022; 13:875617.

PMID: 35846125 PMC: 9283838. DOI: 10.3389/fgene.2022.875617.


References
1.
Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H . The Protein Data Bank. Nucleic Acids Res. 1999; 28(1):235-42. PMC: 102472. DOI: 10.1093/nar/28.1.235. View

2.
Uetz P, Giot L, Cagney G, Mansfield T, Judson R, Knight J . A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature. 2000; 403(6770):623-7. DOI: 10.1038/35001009. View

3.
Schwikowski B, Uetz P, Fields S . A network of protein-protein interactions in yeast. Nat Biotechnol. 2000; 18(12):1257-61. DOI: 10.1038/82360. View

4.
Rain J, Selig L, de Reuse H, Battaglia V, Reverdy C, Simon S . The protein-protein interaction map of Helicobacter pylori. Nature. 2001; 409(6817):211-5. DOI: 10.1038/35051615. View

5.
Ito T, Chiba T, Ozawa R, Yoshida M, Hattori M, Sakaki Y . A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc Natl Acad Sci U S A. 2001; 98(8):4569-74. PMC: 31875. DOI: 10.1073/pnas.061034498. View