Nonlinear Methods in the Analysis of Protein Sequences: a Case Study in Rubredoxins
Overview
Affiliations
Two computational methods widely used in time series analysis were applied to protein sequences, and their ability to derive structural information not directly accessible through classical sequence comparisons methods was assessed. The primary structures of 19 rubredoxins of both mesophilic and thermophilic bacteria, coded with hydrophobicity values of amino acid residues, were considered as time series and were analyzed by 1) recurrence quantification analysis and 2) spectral analysis of the sequence major eigenfunctions. The results of the two methods agreed to a large extent and generated a classification consistent with known 3D structural characteristics of the studied proteins. This classification separated in a clearcut manner a thermophilic protein from mesophilic proteins. The classification of primary structures given by the two dynamical methods was demonstrated to be basically different from classification stemming from classical sequence homology metrics. Moreover, on a more detailed scale, the method was able to discriminate between thermophilic and mesophilic proteins from a set of chimeric sequences generated from the mixing of a mesophilic (Rubr Clopa) and a thermophilic (Rubr Pyrfu) protein. Overall, our results point to a new way of looking at protein sequence comparisons.
Scikit-Dimension: A Python Package for Intrinsic Dimension Estimation.
Bac J, Mirkes E, Gorban A, Tyukin I, Zinovyev A Entropy (Basel). 2021; 23(10).
PMID: 34682092 PMC: 8534554. DOI: 10.3390/e23101368.
Karain W BMC Bioinformatics. 2017; 18(1):525.
PMID: 29179670 PMC: 5704401. DOI: 10.1186/s12859-017-1943-y.
Quantiprot - a Python package for quantitative analysis of protein sequences.
Konopka B, Marciniak M, Dyrka W BMC Bioinformatics. 2017; 18(1):339.
PMID: 28716000 PMC: 5512976. DOI: 10.1186/s12859-017-1751-4.
Andrissi L, Petraglia F, Giuliani A, Severi F, Angioni S, Valensise H PLoS One. 2015; 10(4):e0124353.
PMID: 25905494 PMC: 4408047. DOI: 10.1371/journal.pone.0124353.
Response, use and habituation to a mouse house in C57BL/6J and BALB/c mice.
Wirz A, Mandillo S, DAmato F, Giuliani A, Riviello M Exp Anim. 2015; 64(3):281-93.
PMID: 25854626 PMC: 4548001. DOI: 10.1538/expanim.14-0104.