» Articles » PMID: 35900023

Benchmarking AlphaFold for Protein Complex Modeling Reveals Accuracy Determinants

Overview
Journal Protein Sci
Specialty Biochemistry
Date 2022 Jul 28
PMID 35900023
Authors
Affiliations
Soon will be listed here.
Abstract

High-resolution experimental structural determination of protein-protein interactions has led to valuable mechanistic insights, yet due to the massive number of interactions and experimental limitations there is a need for computational methods that can accurately model their structures. Here we explore the use of the recently developed deep learning method, AlphaFold, to predict structures of protein complexes from sequence. With a benchmark of 152 diverse heterodimeric protein complexes, multiple implementations and parameters of AlphaFold were tested for accuracy. Remarkably, many cases (43%) had near-native models (medium or high critical assessment of predicted interactions accuracy) generated as top-ranked predictions by AlphaFold, greatly surpassing the performance of unbound protein-protein docking (9% success rate for near-native top-ranked models), however AlphaFold modeling of antibody-antigen complexes within our set was unsuccessful. We identified sequence and structural features associated with lack of AlphaFold success, and we also investigated the impact of multiple sequence alignment input. Benchmarking of a multimer-optimized version of AlphaFold (AlphaFold-Multimer) with a set of recently released antibody-antigen structures confirmed a low rate of success for antibody-antigen complexes (11% success), and we found that T cell receptor-antigen complexes are likewise not accurately modeled by that algorithm, showing that adaptive immune recognition poses a challenge for the current AlphaFold algorithm and model. Overall, our study demonstrates that end-to-end deep learning can accurately model many transient protein complexes, and highlights areas of improvement for future developments to reliably model any protein-protein interaction of interest.

Citing Articles

Unlocking the potential of approach in designing antibodies against SARS-CoV-2.

Subramaniam T, Mualif S, Chan W, Abd Halim K Front Bioinform. 2025; 5:1533983.

PMID: 40017562 PMC: 11865036. DOI: 10.3389/fbinf.2025.1533983.


: What's wrong with AlphaFold's score and how to fix it.

Dunbrack R, Dunbrack Jr R bioRxiv. 2025; .

PMID: 39990437 PMC: 11844409. DOI: 10.1101/2025.02.10.637595.


Epitope and Paratope Mapping of a SUMO-Remnant Antibody Using Cross-Linking Mass Spectrometry and Molecular Docking.

Comtois-Marotte S, Bonneil E, Li C, Smith M, Thibault P J Proteome Res. 2025; 24(3):1092-1101.

PMID: 39965925 PMC: 11895775. DOI: 10.1021/acs.jproteome.4c00717.


Sam-Sam Association Between EphA2 and SASH1: In Silico Studies of Cancer-Linked Mutations.

Vincenzi M, Mercurio F, Autiero I, Leone M Molecules. 2025; 30(3).

PMID: 39942820 PMC: 11820823. DOI: 10.3390/molecules30030718.


Predicting Antibody Affinity Changes upon Mutation Based on Unbound Protein Structures.

Chen Z, He S, Chi X, Bo X Int J Mol Sci. 2025; 26(3).

PMID: 39941111 PMC: 11818220. DOI: 10.3390/ijms26031343.


References
1.
Fu L, Niu B, Zhu Z, Wu S, Li W . CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012; 28(23):3150-2. PMC: 3516142. DOI: 10.1093/bioinformatics/bts565. View

2.
Rossjohn J, Gras S, Miles J, Turner S, Godfrey D, McCluskey J . T cell antigen receptor recognition of antigen-presenting molecules. Annu Rev Immunol. 2014; 33:169-200. DOI: 10.1146/annurev-immunol-032414-112334. View

3.
Dey S, Pal A, Chakrabarti P, Janin J . The subunit interfaces of weakly associated homodimeric proteins. J Mol Biol. 2010; 398(1):146-60. DOI: 10.1016/j.jmb.2010.02.020. View

4.
Mirdita M, Schutze K, Moriwaki Y, Heo L, Ovchinnikov S, Steinegger M . ColabFold: making protein folding accessible to all. Nat Methods. 2022; 19(6):679-682. PMC: 9184281. DOI: 10.1038/s41592-022-01488-1. View

5.
Rose P, Beran B, Bi C, Bluhm W, Dimitropoulos D, Goodsell D . The RCSB Protein Data Bank: redesigned web site and web services. Nucleic Acids Res. 2010; 39(Database issue):D392-401. PMC: 3013649. DOI: 10.1093/nar/gkq1021. View