Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with Other Methods
Authors
Affiliations
In this report, a new type of tridimensional (3D) biomacro-molecular descriptors for proteins are proposed. These descriptors make use of multi-linear algebra concepts based on the application of 3-linear forms (i.e., Canonical Trilinear (Tr), Trilinear Cubic (TrC), Trilinear-Quadratic-Bilinear (TrQB) and so on) as a specific case of the N-linear algebraic forms. The definition of the k 3-tuple similarity-dissimilarity spatial matrices (Tensor's Form) are used for the transformation and for the representation of the existing chemical information available in the relationships between three amino acids of a protein. Several metrics (Minkowski-type, wave-edge, etc) and multi-metrics (Triangle area, Bond-angle, etc) are proposed for the interaction information extraction, as well as probabilistic transformations (e.g., simple stochastic and mutual probability) to achieve matrix normalization. A generalized procedure considering amino acid level-based indices that can be fused together by using aggregator operators for descriptors calculations is proposed. The obtained results demonstrated that the new proposed 3D biomacro-molecular indices perform better than other approaches in the SCOP-based discrimination and the prediction of folding rate of proteins by using simple linear parametrical models. It can be concluded that the proposed method allows the definition of 3D biomacro-molecular descriptors that contain orthogonal information capable of providing better models for applications in protein science.
Emonts J, Buyel J Comput Struct Biotechnol J. 2024; 21:3234-3247.
PMID: 38213891 PMC: 10781719. DOI: 10.1016/j.csbj.2023.05.022.
Contreras-Torres E, Marrero-Ponce Y, Teran J, Aguero-Chapin G, Antunes A, Garcia-Jacas C Front Chem. 2022; 10:959143.
PMID: 36277354 PMC: 9585278. DOI: 10.3389/fchem.2022.959143.
Graph Theory-Based Sequence Descriptors as Remote Homology Predictors.
Aguero-Chapin G, Galpert D, Molina-Ruiz R, Ancede-Gallardo E, Perez-Machado G, de la Riva G Biomolecules. 2019; 10(1).
PMID: 31878100 PMC: 7022958. DOI: 10.3390/biom10010026.