Protein Interaction Maps for Complete Genomes Based on Gene Fusion Events
Affiliations
A large-scale effort to measure, detect and analyse protein-protein interactions using experimental methods is under way. These include biochemistry such as co-immunoprecipitation or crosslinking, molecular biology such as the two-hybrid system or phage display, and genetics such as unlinked noncomplementing mutant detection. Using the two-hybrid system, an international effort to analyse the complete yeast genome is in progress. Evidently, all these approaches are tedious, labour intensive and inaccurate. From a computational perspective, the question is how can we predict that two proteins interact from structure or sequence alone. Here we present a method that identifies gene-fusion events in complete genomes, solely based on sequence comparison. Because there must be selective pressure for certain genes to be fused over the course of evolution, we are able to predict functional associations of proteins. We show that 215 genes or proteins in the complete genomes of Escherichia coli, Haemophilus influenzae and Methanococcus jannaschii are involved in 64 unique fusion events. The approach is general, and can be applied even to genes of unknown function.
Dead-End protein expression, function, and mutation in cancer: a systematic review.
Faraji H, Banakar F, Sadri A, Ebrahim-Habibi A Mol Biol Rep. 2025; 52(1):291.
PMID: 40053212 DOI: 10.1007/s11033-025-10325-5.
From duplication to fusion: Expanding Dayhoff's model of protein evolution.
Muthahari Y, Magnus L, Laurino P Protein Sci. 2025; 34(3):e70054.
PMID: 39969106 PMC: 11837038. DOI: 10.1002/pro.70054.
Review and Comparative Analysis of Methods and Advancements in Predicting Protein Complex Structure.
Zhao N, Wu T, Wang W, Zhang L, Gong X Interdiscip Sci. 2024; 16(2):261-288.
PMID: 38955920 DOI: 10.1007/s12539-024-00626-x.
Pairing interacting protein sequences using masked language modeling.
Lupo U, Sgarbossa D, Bitbol A Proc Natl Acad Sci U S A. 2024; 121(27):e2311887121.
PMID: 38913900 PMC: 11228504. DOI: 10.1073/pnas.2311887121.
Fusion/fission protein family identification in Archaea.
Padalko A, Nair G, Sousa F mSystems. 2024; 9(6):e0094823.
PMID: 38700364 PMC: 11237513. DOI: 10.1128/msystems.00948-23.