» Articles » PMID: 34805992

MmCSM-NA: Accurately Predicting Effects of Single and Multiple Mutations on Protein-nucleic Acid Binding Affinity

Overview
Specialty Biology
Date 2021 Nov 22
PMID 34805992
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

While protein-nucleic acid interactions are pivotal for many crucial biological processes, limited experimental data has made the development of computational approaches to characterise these interactions a challenge. Consequently, most approaches to understand the effects of missense mutations on protein-nucleic acid affinity have focused on single-point mutations and have presented a limited performance on independent data sets. To overcome this, we have curated the largest dataset of experimentally measured effects of mutations on nucleic acid binding affinity to date, encompassing 856 single-point mutations and 141 multiple-point mutations across 155 experimentally solved complexes. This was used in combination with an optimized version of our graph-based signatures to develop mmCSM-NA (http://biosig.unimelb.edu.au/mmcsm_na), the first scalable method capable of quantitatively and accurately predicting the effects of multiple-point mutations on nucleic acid binding affinities. mmCSM-NA obtained a Pearson's correlation of up to 0.67 (RMSE of 1.06 Kcal/mol) on single-point mutations under cross-validation, and up to 0.65 on independent non-redundant datasets of multiple-point mutations (RMSE of 1.12 kcal/mol), outperforming similar tools. mmCSM-NA is freely available as an easy-to-use web-server and API. We believe it will be an invaluable tool to shed light on the role of mutations affecting protein-nucleic acid interactions in diseases.

Citing Articles

Decoding the effects of mutation on protein interactions using machine learning.

Xu W, Li A, Zhao Y, Peng Y Biophys Rev (Melville). 2025; 6(1):011307.

PMID: 40013003 PMC: 11857871. DOI: 10.1063/5.0249920.


Further Development of SAMPDI-3D: A Machine Learning Method for Predicting Binding Free Energy Changes Caused by Mutations in Either Protein or DNA.

Rimal P, Paul S, Panday S, Alexov E Genes (Basel). 2025; 16(1).

PMID: 39858648 PMC: 11764785. DOI: 10.3390/genes16010101.


Rifaximin prophylaxis causes resistance to the last-resort antibiotic daptomycin.

Turner A, Li L, Monk I, Lee J, Ingle D, Portelli S Nature. 2024; 635(8040):969-977.

PMID: 39443798 PMC: 11602712. DOI: 10.1038/s41586-024-08095-4.


PNBACE: an ensemble algorithm to predict the effects of mutations on protein-nucleic acid binding affinity.

Xiao S, Zhang Y, Liu K, Huang Y, Liu R BMC Biol. 2024; 22(1):203.

PMID: 39256728 PMC: 11389284. DOI: 10.1186/s12915-024-02006-9.


PRIMITI: A computational approach for accurate prediction of miRNA-target mRNA interaction.

Uthayopas K, de Sa A, Alavi A, Pires D, Ascher D Comput Struct Biotechnol J. 2024; 23:3030-3039.

PMID: 39175797 PMC: 11340604. DOI: 10.1016/j.csbj.2024.06.030.


References
1.
Myung Y, Pires D, Ascher D . mmCSM-AB: guiding rational antibody engineering through multiple point mutations. Nucleic Acids Res. 2020; 48(W1):W125-W131. PMC: 7319589. DOI: 10.1093/nar/gkaa389. View

2.
von Hippel P, Bear D, Morgan W, McSwiggen J . Protein-nucleic acid interactions in transcription: a molecular analysis. Annu Rev Biochem. 1984; 53:389-446. DOI: 10.1146/annurev.bi.53.070184.002133. View

3.
Portelli S, Phelan J, Ascher D, Clark T, Furnham N . Understanding molecular consequences of putative drug resistant mutations in Mycobacterium tuberculosis. Sci Rep. 2018; 8(1):15356. PMC: 6193939. DOI: 10.1038/s41598-018-33370-6. View

4.
Zhang S, Zhao L, Zheng C, Xia J . A feature-based approach to predict hot spots in protein-DNA binding interfaces. Brief Bioinform. 2019; 21(3):1038-1046. DOI: 10.1093/bib/bbz037. View

5.
Pan Y, Wang Z, Zhan W, Deng L . Computational identification of binding energy hot spots in protein-RNA complexes using an ensemble approach. Bioinformatics. 2017; 34(9):1473-1480. DOI: 10.1093/bioinformatics/btx822. View