» Articles » PMID: 37371503

Mathematical and Machine Learning Approaches for Classification of Protein Secondary Structure Elements from Coordinates

Overview
Journal Biomolecules
Publisher MDPI
Date 2023 Jun 28
PMID 37371503
Authors
Affiliations
Soon will be listed here.
Abstract

Determining Secondary Structure Elements (SSEs) for any protein is crucial as an intermediate step for experimental tertiary structure determination. SSEs are identified using popular tools such as DSSP and STRIDE. These tools use atomic information to locate hydrogen bonds to identify SSEs. When some spatial atomic details are missing, locating SSEs becomes a hinder. To address the problem, when some atomic information is missing, three approaches for classifying SSE types using Cα atoms in protein chains were developed: (1) a mathematical approach, (2) a deep learning approach, and (3) an ensemble of five machine learning models. The proposed methods were compared against each other and with a state-of-the-art approach, PCASSO.

References
1.
Burley S, Berman H, Bhikadiya C, Bi C, Chen L, Di Costanzo L . RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy. Nucleic Acids Res. 2018; 47(D1):D464-D474. PMC: 6324064. DOI: 10.1093/nar/gky1004. View

2.
Richards F, Kundrot C . Identification of structural motifs from protein coordinate data: secondary structure and first-level supersecondary structure. Proteins. 1988; 3(2):71-84. DOI: 10.1002/prot.340030202. View

3.
Provencher S, Glockner J . Estimation of globular protein secondary structure from circular dichroism. Biochemistry. 1981; 20(1):33-7. DOI: 10.1021/bi00504a006. View

4.
Labesse G, Colloch N, Pothier J, Mornon J . P-SEA: a new efficient assignment of secondary structure from C alpha trace of proteins. Comput Appl Biosci. 1997; 13(3):291-5. DOI: 10.1093/bioinformatics/13.3.291. View

5.
Sussman J, Lin D, Jiang J, Manning N, Prilusky J, Ritter O . Protein Data Bank (PDB): database of three-dimensional structural information of biological macromolecules. Acta Crystallogr D Biol Crystallogr. 1999; 54(Pt 6 Pt 1):1078-84. DOI: 10.1107/s0907444998009378. View