CombFold: Predicting Structures of Large Protein Assemblies Using a Combinatorial Assembly Algorithm and AlphaFold2
Overview
Authors
Affiliations
Deep learning models, such as AlphaFold2 and RosettaFold, enable high-accuracy protein structure prediction. However, large protein complexes are still challenging to predict due to their size and the complexity of interactions between multiple subunits. Here we present CombFold, a combinatorial and hierarchical assembly algorithm for predicting structures of large protein complexes utilizing pairwise interactions between subunits predicted by AlphaFold2. CombFold accurately predicted (TM-score >0.7) 72% of the complexes among the top-10 predictions in two datasets of 60 large, asymmetric assemblies. Moreover, the structural coverage of predicted complexes was 20% higher compared to corresponding Protein Data Bank entries. We applied the method on complexes from Complex Portal with known stoichiometry but without known structure and obtained high-confidence predictions. CombFold supports the integration of distance restraints based on crosslinking mass spectrometry and fast enumeration of possible complex stoichiometries. CombFold's high accuracy makes it a promising tool for expanding structural coverage beyond monomeric proteins.
Cheng J, Liu J, Neupane P Res Sq. 2025; .
PMID: 39975926 PMC: 11838762. DOI: 10.21203/rs.3.rs-5855710/v1.
Frontiers in integrative structural modeling of macromolecular assemblies.
Majila K, Arvindekar S, Jindal M, Viswanath S QRB Discov. 2025; 6:e3.
PMID: 39944881 PMC: 11811862. DOI: 10.1017/qrd.2024.15.
Liu J, Neupane P, Cheng J bioRxiv. 2025; .
PMID: 39868088 PMC: 11761747. DOI: 10.1101/2025.01.12.632663.
Lalit F, Jose A Nucleic Acids Res. 2025; 53(1.
PMID: 39788543 PMC: 11717427. DOI: 10.1093/nar/gkae1246.
Madaj R, Martinez-Goikoetxea M, Kaminski K, Ludwiczak J, Dunin-Horkawicz S Protein Sci. 2024; 34(1):e5244.
PMID: 39688306 PMC: 11651203. DOI: 10.1002/pro.5244.