» Articles » PMID: 27193686

Local Fitness Landscape of the Green Fluorescent Protein

Abstract

Fitness landscapes depict how genotypes manifest at the phenotypic level and form the basis of our understanding of many areas of biology, yet their properties remain elusive. Previous studies have analysed specific genes, often using their function as a proxy for fitness, experimentally assessing the effect on function of single mutations and their combinations in a specific sequence or in different sequences. However, systematic high-throughput studies of the local fitness landscape of an entire protein have not yet been reported. Here we visualize an extensive region of the local fitness landscape of the green fluorescent protein from Aequorea victoria (avGFP) by measuring the native function (fluorescence) of tens of thousands of derivative genotypes of avGFP. We show that the fitness landscape of avGFP is narrow, with 3/4 of the derivatives with a single mutation showing reduced fluorescence and half of the derivatives with four mutations being completely non-fluorescent. The narrowness is enhanced by epistasis, which was detected in up to 30% of genotypes with multiple mutations and mostly occurred through the cumulative effect of slightly deleterious mutations causing a threshold-like decrease in protein stability and a concomitant loss of fluorescence. A model of orthologous sequence divergence spanning hundreds of millions of years predicted the extent of epistasis in our data, indicating congruence between the fitness landscape properties at the local and global scales. The characterization of the local fitness landscape of avGFP has important implications for several fields including molecular evolution, population genetics and protein design.

Citing Articles

MMRT: MultiMut Recursive Tree for predicting functional effects of high-order protein variants from low-order variants.

Forrest B, Derbel H, Zhao Z, Liu Q Comput Struct Biotechnol J. 2025; 27:672-681.

PMID: 40070521 PMC: 11894328. DOI: 10.1016/j.csbj.2025.02.012.


Resolving discrepancies between chimeric and multiplicative measures of higher-order epistasis.

Chitra U, Arnold B, Raphael B Nat Commun. 2025; 16(1):1711.

PMID: 39962081 PMC: 11833126. DOI: 10.1038/s41467-025-56986-5.


Self-supervised machine learning methods for protein design improve sampling but not the identification of high-fitness variants.

Ertelt M, Moretti R, Meiler J, Schoeder C Sci Adv. 2025; 11(7):eadr7338.

PMID: 39937901 PMC: 11817935. DOI: 10.1126/sciadv.adr7338.


DeepPFP: a multi-task-aware architecture for protein function prediction.

Wang H, Ren Z, Sun J, Chen Y, Bo X, Xue J Brief Bioinform. 2025; 26(1).

PMID: 39905954 PMC: 11794456. DOI: 10.1093/bib/bbae579.


CPI-Pred: A deep learning framework for predicting functional parameters of compound-protein interactions.

Xu Z, Barghout R, Wu J, Garg D, Song Y, Mahadevan R bioRxiv. 2025; .

PMID: 39896624 PMC: 11785036. DOI: 10.1101/2025.01.16.633372.


References
1.
Coates M, Garm A, Theobald J, Thompson S, Nilsson D . The spectral sensitivity of the lens eyes of a box jellyfish, Tripedalia cystophora (Conant). J Exp Biol. 2006; 209(Pt 19):3758-65. DOI: 10.1242/jeb.02431. View

2.
Jacquier H, Birgy A, Le Nagard H, Mechulam Y, Schmitt E, Glodt J . Capturing the mutational landscape of the beta-lactamase TEM-1. Proc Natl Acad Sci U S A. 2013; 110(32):13067-72. PMC: 3740883. DOI: 10.1073/pnas.1215206110. View

3.
Melamed D, Young D, Gamble C, Miller C, Fields S . Deep mutational scanning of an RRM domain of the Saccharomyces cerevisiae poly(A)-binding protein. RNA. 2013; 19(11):1537-51. PMC: 3851721. DOI: 10.1261/rna.040709.113. View

4.
Bank C, Hietpas R, Jensen J, Bolon D . A systematic survey of an intragenic epistatic landscape. Mol Biol Evol. 2014; 32(1):229-38. PMC: 4271534. DOI: 10.1093/molbev/msu301. View

5.
Kimura M, Crow J . Effect of overall phenotypic selection on genetic change at individual loci. Proc Natl Acad Sci U S A. 1978; 75(12):6168-71. PMC: 393140. DOI: 10.1073/pnas.75.12.6168. View