FilterDCA: Interpretable Supervised Contact Prediction Using Inter-domain Coevolution
Overview
Affiliations
Predicting three-dimensional protein structure and assembling protein complexes using sequence information belongs to the most prominent tasks in computational biology. Recently substantial progress has been obtained in the case of single proteins using a combination of unsupervised coevolutionary sequence analysis with structurally supervised deep learning. While reaching impressive accuracies in predicting residue-residue contacts, deep learning has a number of disadvantages. The need for large structural training sets limits the applicability to multi-protein complexes; and their deep architecture makes the interpretability of the convolutional neural networks intrinsically hard. Here we introduce FilterDCA, a simpler supervised predictor for inter-domain and inter-protein contacts. It is based on the fact that contact maps of proteins show typical contact patterns, which results from secondary structure and are reflected by patterns in coevolutionary analysis. We explicitly integrate averaged contacts patterns with coevolutionary scores derived by Direct Coupling Analysis, improving performance over standard coevolutionary analysis, while remaining fully transparent and interpretable. The FilterDCA code is available at http://gitlab.lcqb.upmc.fr/muscat/FilterDCA.
Generating interacting protein sequences using domain-to-domain translation.
Meynard-Piganeau B, Fabbri C, Weigt M, Pagnani A, Feinauer C Bioinformatics. 2023; 39(7).
PMID: 37399105 PMC: 10329493. DOI: 10.1093/bioinformatics/btad401.
Characterizing interactions in E-cadherin assemblages.
Shome S, Jia K, Sivasankar S, Jernigan R Biophys J. 2023; 122(15):3069-3077.
PMID: 37345249 PMC: 10432173. DOI: 10.1016/j.bpj.2023.06.009.
Inverse Potts model improves accuracy of phylogenetic profiling.
Fukunaga T, Iwasaki W Bioinformatics. 2022; 38(7):1794-1800.
PMID: 35060594 PMC: 8963296. DOI: 10.1093/bioinformatics/btac034.
CoCoNet-boosting RNA contact prediction by convolutional neural networks.
Zerihun M, Pucci F, Schug A Nucleic Acids Res. 2021; 49(22):12661-12672.
PMID: 34871451 PMC: 8682773. DOI: 10.1093/nar/gkab1144.
Evaluation of residue-residue contact prediction methods: From retrospective to prospective.
Zhang H, Bei Z, Xi W, Hao M, Ju Z, Saravanan K PLoS Comput Biol. 2021; 17(5):e1009027.
PMID: 34029314 PMC: 8177648. DOI: 10.1371/journal.pcbi.1009027.