» Articles » PMID: 23812989

Predicting Protein Interactions Via Parsimonious Network History Inference

Overview
Journal Bioinformatics
Specialty Biology
Date 2013 Jul 2
PMID 23812989
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Reconstruction of the network-level evolutionary history of protein-protein interactions provides a principled way to relate interactions in several present-day networks. Here, we present a general framework for inferring such histories and demonstrate how it can be used to determine what interactions existed in the ancestral networks, which present-day interactions we might expect to exist based on evolutionary evidence and what information extant networks contain about the order of ancestral protein duplications.

Results: Our framework characterizes the space of likely parsimonious network histories. It results in a structure that can be used to find probabilities for a number of events associated with the histories. The framework is based on a directed hypergraph formulation of dynamic programming that we extend to enumerate many optimal and near-optimal solutions. The algorithm is applied to reconstructing ancestral interactions among bZIP transcription factors, imputing missing present-day interactions among the bZIPs and among proteins from five herpes viruses, and determining relative protein duplication order in the bZIP family. Our approach more accurately reconstructs ancestral interactions than existing approaches. In cross-validation tests, we find that our approach ranks the majority of the left-out present-day interactions among the top 2 and 17% of possible edges for the bZIP and herpes networks, respectively, making it a competitive approach for edge imputation. It also estimates relative bZIP protein duplication orders, using only interaction data and phylogenetic tree topology, which are significantly correlated with sequence-based estimates.

Availability: The algorithm is implemented in C++, is open source and is available at http://www.cs.cmu.edu/ckingsf/software/parana2.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

Proteogenomic insights suggest druggable pathways in endometrial carcinoma.

Dou Y, Katsnelson L, Gritsenko M, Hu Y, Reva B, Hong R Cancer Cell. 2023; 41(9):1586-1605.e15.

PMID: 37567170 PMC: 10631452. DOI: 10.1016/j.ccell.2023.07.007.


A parsimonious approach for recognizing SARS-CoV-2 and host interactions.

Ganguly B J Med Virol. 2021; 93(7):4576-4584.

PMID: 33506962 PMC: 8014500. DOI: 10.1002/jmv.26824.


Maximum likelihood reconstruction of ancestral networks by integer linear programming.

Rajan V, Zhang Z, Kingsford C, Zhang X Bioinformatics. 2020; 37(8):1083-1092.

PMID: 33135733 PMC: 8599758. DOI: 10.1093/bioinformatics/btaa931.


Ancestral state reconstruction of metabolic pathways across pangenome ensembles.

Psomopoulos F, van Helden J, Medigue C, Chasapi A, Ouzounis C Microb Genom. 2020; 6(11).

PMID: 32924924 PMC: 7725326. DOI: 10.1099/mgen.0.000429.

References
1.
Mithani A, Preston G, Hein J . A stochastic model for the evolution of metabolic networks with neighbor dependence. Bioinformatics. 2009; 25(12):1528-35. DOI: 10.1093/bioinformatics/btp262. View

2.
Borenstein E, Feldman M . Topological signatures of species interactions in metabolic networks. J Comput Biol. 2009; 16(2):191-200. PMC: 3035845. DOI: 10.1089/cmb.2008.06TT. View

3.
Dutkowski J, Tiuryn J . Identification of functional modules from conserved ancestral protein-protein interactions. Bioinformatics. 2007; 23(13):i149-58. DOI: 10.1093/bioinformatics/btm194. View

4.
Pereira-Leal J, Levy E, Kamp C, Teichmann S . Evolution of protein complexes by duplication of homomeric interactions. Genome Biol. 2007; 8(4):R51. PMC: 1895999. DOI: 10.1186/gb-2007-8-4-r51. View

5.
Huerta-Cepas J, Dopazo J, Gabaldon T . ETE: a python Environment for Tree Exploration. BMC Bioinformatics. 2010; 11:24. PMC: 2820433. DOI: 10.1186/1471-2105-11-24. View