» Articles » PMID: 31388082

Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with Other Methods

Overview
Journal Sci Rep
Specialty Science
Date 2019 Aug 8
PMID 31388082
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

In this report, a new type of tridimensional (3D) biomacro-molecular descriptors for proteins are proposed. These descriptors make use of multi-linear algebra concepts based on the application of 3-linear forms (i.e., Canonical Trilinear (Tr), Trilinear Cubic (TrC), Trilinear-Quadratic-Bilinear (TrQB) and so on) as a specific case of the N-linear algebraic forms. The definition of the k 3-tuple similarity-dissimilarity spatial matrices (Tensor's Form) are used for the transformation and for the representation of the existing chemical information available in the relationships between three amino acids of a protein. Several metrics (Minkowski-type, wave-edge, etc) and multi-metrics (Triangle area, Bond-angle, etc) are proposed for the interaction information extraction, as well as probabilistic transformations (e.g., simple stochastic and mutual probability) to achieve matrix normalization. A generalized procedure considering amino acid level-based indices that can be fused together by using aggregator operators for descriptors calculations is proposed. The obtained results demonstrated that the new proposed 3D biomacro-molecular indices perform better than other approaches in the SCOP-based discrimination and the prediction of folding rate of proteins by using simple linear parametrical models. It can be concluded that the proposed method allows the definition of 3D biomacro-molecular descriptors that contain orthogonal information capable of providing better models for applications in protein science.

Citing Articles

An overview of descriptors to capture protein properties - Tools and perspectives in the context of QSAR modeling.

Emonts J, Buyel J Comput Struct Biotechnol J. 2024; 21:3234-3247.

PMID: 38213891 PMC: 10781719. DOI: 10.1016/j.csbj.2023.05.022.


Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions.

Contreras-Torres E, Marrero-Ponce Y, Teran J, Aguero-Chapin G, Antunes A, Garcia-Jacas C Front Chem. 2022; 10:959143.

PMID: 36277354 PMC: 9585278. DOI: 10.3389/fchem.2022.959143.


Graph Theory-Based Sequence Descriptors as Remote Homology Predictors.

Aguero-Chapin G, Galpert D, Molina-Ruiz R, Ancede-Gallardo E, Perez-Machado G, de la Riva G Biomolecules. 2019; 10(1).

PMID: 31878100 PMC: 7022958. DOI: 10.3390/biom10010026.

References
1.
Mishra A, Rana P, Mittal A, Jayaram B . D2N: Distance to the native. Biochim Biophys Acta. 2014; 1844(10):1798-807. DOI: 10.1016/j.bbapap.2014.07.010. View

2.
Chen K, Kurgan L, Ruan J . Prediction of protein structural class using novel evolutionary collocation-based sequence representation. J Comput Chem. 2008; 29(10):1596-604. DOI: 10.1002/jcc.20918. View

3.
Chou K . Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes. Bioinformatics. 2004; 21(1):10-9. DOI: 10.1093/bioinformatics/bth466. View

4.
Klein D, Palacios J, Randic M, Trinajstic N . Random walks and chemical graph theory. J Chem Inf Comput Sci. 2004; 44(5):1521-5. DOI: 10.1021/ci040100e. View

5.
Cai Y, Liu X, Xu X, Chou K . Prediction of protein structural classes by support vector machines. Comput Chem. 2002; 26(3):293-6. DOI: 10.1016/s0097-8485(01)00113-9. View