GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects
Overview
Affiliations
The systematic and accurate description of protein mutational landscapes is a question of utmost importance in biology, bioengineering, and medicine. Recent progress has been achieved by leveraging on the increasing wealth of genomic data and by modeling intersite dependencies within biological sequences. However, state-of-the-art methods remain time consuming. Here, we present Global Epistatic Model for predicting Mutational Effects (GEMME) (www.lcqb.upmc.fr/GEMME), an original and fast method that predicts mutational outcomes by explicitly modeling the evolutionary history of natural sequences. This allows accounting for all positions in a sequence when estimating the effect of a given mutation. GEMME uses only a few biologically meaningful and interpretable parameters. Assessed against 50 high- and low-throughput mutational experiments, it overall performs similarly or better than existing methods. It accurately predicts the mutational landscapes of a wide range of protein families, including viral ones and, more generally, of much conserved families. Given an input alignment, it generates the full mutational landscape of a protein in a matter of minutes. It is freely available as a package and a webserver at www.lcqb.upmc.fr/GEMME/.
Cryptic genetic variation shapes the fate of gene duplicates in a protein interaction network.
Dibyachintan S, Dube A, Bradley D, Lemieux P, Dionne U, Landry C Nat Commun. 2025; 16(1):1530.
PMID: 39934115 PMC: 11814230. DOI: 10.1038/s41467-025-56597-0.
Exploring Evolution to Uncover Insights Into Protein Mutational Stability.
Hermans P, Tsishyn M, Schwersensky M, Rooman M, Pucci F Mol Biol Evol. 2025; 42(1).
PMID: 39786559 PMC: 11721782. DOI: 10.1093/molbev/msae267.
Protein stability models fail to capture epistatic interactions of double point mutations.
Dieckhaus H, Kuhlman B Protein Sci. 2024; 34(1):e70003.
PMID: 39704075 PMC: 11659742. DOI: 10.1002/pro.70003.
A general temperature-guided language model to design proteins of enhanced stability and activity.
Jiang F, Li M, Dong J, Yu Y, Sun X, Wu B Sci Adv. 2024; 10(48):eadr2641.
PMID: 39602544 PMC: 11601203. DOI: 10.1126/sciadv.adr2641.
Expert-guided protein language models enable accurate and blazingly fast fitness prediction.
Marquet C, Schlensok J, Abakarova M, Rost B, Laine E Bioinformatics. 2024; 40(11).
PMID: 39576695 PMC: 11588025. DOI: 10.1093/bioinformatics/btae621.