» Articles » PMID: 38449296

Graph-theoretical Prediction of Biological Modules in Quaternary Structures of Large Protein Complexes

Overview
Journal Bioinformatics
Specialty Biology
Date 2024 Mar 7
PMID 38449296
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: The functional complexity of biochemical processes is strongly related to the interplay of proteins and their assembly into protein complexes. In recent years, the discovery and characterization of protein complexes have substantially progressed through advances in cryo-electron microscopy, proteomics, and computational structure prediction. This development results in a strong need for computational approaches to analyse the data of large protein complexes for structural and functional characterization. Here, we aim to provide a suitable approach, which processes the growing number of large protein complexes, to obtain biologically meaningful information on the hierarchical organization of the structures of protein complexes.

Results: We modelled the quaternary structure of protein complexes as undirected, labelled graphs called complex graphs. In complex graphs, the vertices represent protein chains and the edges spatial chain-chain contacts. We hypothesized that clusters based on the complex graph correspond to functional biological modules. To compute the clusters, we applied the Leiden clustering algorithm. To evaluate our approach, we chose the human respiratory complex I, which has been extensively investigated and exhibits a known biological module structure experimentally validated. Additionally, we characterized a eukaryotic group II chaperonin TRiC/CCT and the head of the bacteriophage Φ29. The analysis of the protein complexes correlated with experimental findings and indicated known functional, biological modules. Using our approach enables not only to predict functional biological modules in large protein complexes with characteristic features but also to investigate the flexibility of specific regions and coformational changes. The predicted modules can aid in the planning and analysis of experiments.

Availability And Implementation: Jupyter notebooks to reproduce the examples are available on our public GitHub repository: https://github.com/MolBIFFM/PTGLtools/tree/main/PTGLmodulePrediction.

References
1.
Lin Z, Akin H, Rao R, Hie B, Zhu Z, Lu W . Evolutionary-scale prediction of atomic-level protein structure with a language model. Science. 2023; 379(6637):1123-1130. DOI: 10.1126/science.ade2574. View

2.
Kampjut D, Sazanov L . The coupling mechanism of mammalian respiratory complex I. Science. 2020; 370(6516). DOI: 10.1126/science.abc4209. View

3.
Brandt U . Energy converting NADH:quinone oxidoreductase (complex I). Annu Rev Biochem. 2006; 75:69-92. DOI: 10.1146/annurev.biochem.75.103004.142539. View

4.
OReilly F, Graziadei A, Forbrig C, Bremenkamp R, Charles K, Lenz S . Protein complexes in cells by AI-assisted structural proteomics. Mol Syst Biol. 2023; 19(4):e11544. PMC: 10090944. DOI: 10.15252/msb.202311544. View

5.
Marsh J, Teichmann S, Forman-Kay J . Probing the diverse landscape of protein flexibility and binding. Curr Opin Struct Biol. 2012; 22(5):643-50. DOI: 10.1016/j.sbi.2012.08.008. View