» Articles » PMID: 25843214

Novel 3D Bio-macromolecular Bilinear Descriptors for Protein Science: Predicting Protein Structural Classes

Overview
Journal J Theor Biol
Publisher Elsevier
Specialty Biology
Date 2015 Apr 7
PMID 25843214
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

In the present study, we introduce novel 3D protein descriptors based on the bilinear algebraic form in the ℝ(n) space on the coulombic matrix. For the calculation of these descriptors, macromolecular vectors belonging to ℝ(n) space, whose components represent certain amino acid side-chain properties, were used as weighting schemes. Generalization approaches for the calculation of inter-amino acidic residue spatial distances based on Minkowski metrics are proposed. The simple- and double-stochastic schemes were defined as approaches to normalize the coulombic matrix. The local-fragment indices for both amino acid-types and amino acid-groups are presented in order to permit characterizing fragments of interest in proteins. On the other hand, with the objective of taking into account specific interactions among amino acids in global or local indices, geometric and topological cut-offs are defined. To assess the utility of global and local indices a classification model for the prediction of the major four protein structural classes, was built with the Linear Discriminant Analysis (LDA) technique. The developed LDA-model correctly classifies the 92.6% and 92.7% of the proteins on the training and test sets, respectively. The obtained model showed high values of the generalized square correlation coefficient (GC(2)) on both the training and test series. The statistical parameters derived from the internal and external validation procedures demonstrate the robustness, stability and the high predictive power of the proposed model. The performance of the LDA-model demonstrates the capability of the proposed indices not only to codify relevant biochemical information related to the structural classes of proteins, but also to yield suitable interpretability. It is anticipated that the current method will benefit the prediction of other protein attributes or functions.

Citing Articles

An overview of descriptors to capture protein properties - Tools and perspectives in the context of QSAR modeling.

Emonts J, Buyel J Comput Struct Biotechnol J. 2024; 21:3234-3247.

PMID: 38213891 PMC: 10781719. DOI: 10.1016/j.csbj.2023.05.022.


Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions.

Contreras-Torres E, Marrero-Ponce Y, Teran J, Aguero-Chapin G, Antunes A, Garcia-Jacas C Front Chem. 2022; 10:959143.

PMID: 36277354 PMC: 9585278. DOI: 10.3389/fchem.2022.959143.


Graph Theory-Based Sequence Descriptors as Remote Homology Predictors.

Aguero-Chapin G, Galpert D, Molina-Ruiz R, Ancede-Gallardo E, Perez-Machado G, de la Riva G Biomolecules. 2019; 10(1).

PMID: 31878100 PMC: 7022958. DOI: 10.3390/biom10010026.


Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with other Methods.

Teran J, Marrero-Ponce Y, Contreras-Torres E, Garcia-Jacas C, Vivas-Reyes R, Teran E Sci Rep. 2019; 9(1):11391.

PMID: 31388082 PMC: 6684663. DOI: 10.1038/s41598-019-47858-2.


Scaffold-Hopping from Synthetic Drugs by Holistic Molecular Representation.

Grisoni F, Merk D, Byrne R, Schneider G Sci Rep. 2018; 8(1):16469.

PMID: 30405170 PMC: 6220272. DOI: 10.1038/s41598-018-34677-0.