TY - JOUR
T1 - Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research
T2 - Theory, Applications and Comparison with other Methods
AU - Terán, Julio E.
AU - Marrero-Ponce, Yovani
AU - Contreras-Torres, Ernesto
AU - García-Jacas, César R.
AU - Vivas-Reyes, Ricardo
AU - Terán, Enrique
AU - Torres, F. Javier
N1 - Funding Information:
Yovani Marrero-Ponce (M.-P., Y) thanks to the program Profesor convitado for a post-doctoral fellowship to work at Valencia University in 2018–2019. M-P, Y acknowledges the support from USFQ “Chancellor Grant 2017– 2018 (Project ID11192)”. C.R.G.J. acknowledges the support from “Consejo Nacional de Ciencia y Tecnología (CONACYT)” for the endowed chair 501/2018 at “Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE)”. F. Javier Torres thank USFQ POLY-GRANTS program for financial support. The present study has been performed by employing the resources of the USFQ’s High Performance Computing System (HPC-USFQ).
Publisher Copyright:
© 2019, The Author(s).
PY - 2019/12/1
Y1 - 2019/12/1
N2 - In this report, a new type of tridimensional (3D) biomacro-molecular descriptors for proteins are proposed. These descriptors make use of multi-linear algebra concepts based on the application of 3-linear forms (i.e., Canonical Trilinear (Tr), Trilinear Cubic (TrC), Trilinear-Quadratic-Bilinear (TrQB) and so on) as a specific case of the N-linear algebraic forms. The definition of the kth 3-tuple similarity-dissimilarity spatial matrices (Tensor’s Form) are used for the transformation and for the representation of the existing chemical information available in the relationships between three amino acids of a protein. Several metrics (Minkowski-type, wave-edge, etc) and multi-metrics (Triangle area, Bond-angle, etc) are proposed for the interaction information extraction, as well as probabilistic transformations (e.g., simple stochastic and mutual probability) to achieve matrix normalization. A generalized procedure considering amino acid level-based indices that can be fused together by using aggregator operators for descriptors calculations is proposed. The obtained results demonstrated that the new proposed 3D biomacro-molecular indices perform better than other approaches in the SCOP-based discrimination and the prediction of folding rate of proteins by using simple linear parametrical models. It can be concluded that the proposed method allows the definition of 3D biomacro-molecular descriptors that contain orthogonal information capable of providing better models for applications in protein science.
AB - In this report, a new type of tridimensional (3D) biomacro-molecular descriptors for proteins are proposed. These descriptors make use of multi-linear algebra concepts based on the application of 3-linear forms (i.e., Canonical Trilinear (Tr), Trilinear Cubic (TrC), Trilinear-Quadratic-Bilinear (TrQB) and so on) as a specific case of the N-linear algebraic forms. The definition of the kth 3-tuple similarity-dissimilarity spatial matrices (Tensor’s Form) are used for the transformation and for the representation of the existing chemical information available in the relationships between three amino acids of a protein. Several metrics (Minkowski-type, wave-edge, etc) and multi-metrics (Triangle area, Bond-angle, etc) are proposed for the interaction information extraction, as well as probabilistic transformations (e.g., simple stochastic and mutual probability) to achieve matrix normalization. A generalized procedure considering amino acid level-based indices that can be fused together by using aggregator operators for descriptors calculations is proposed. The obtained results demonstrated that the new proposed 3D biomacro-molecular indices perform better than other approaches in the SCOP-based discrimination and the prediction of folding rate of proteins by using simple linear parametrical models. It can be concluded that the proposed method allows the definition of 3D biomacro-molecular descriptors that contain orthogonal information capable of providing better models for applications in protein science.
UR - http://www.scopus.com/inward/record.url?scp=85070211701&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85070211701&partnerID=8YFLogxK
U2 - 10.1038/s41598-019-47858-2
DO - 10.1038/s41598-019-47858-2
M3 - Research Article
C2 - 31388082
AN - SCOPUS:85070211701
SN - 2045-2322
VL - 9
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 11391
ER -