ISulfoTyr-PseAAC: Identify Tyrosine Sulfation Sites by Incorporating Statistical Moments Chou's 5-steps Rule and Pseudo Components
Overview
Affiliations
Background: The amino acid residues, in protein, undergo post-translation modification (PTM) during protein synthesis, a process of chemical and physical change in an amino acid that in turn alters behavioral properties of proteins. Tyrosine sulfation is a ubiquitous posttranslational modification which is known to be associated with regulation of various biological functions and pathological pro-cesses. Thus its identification is necessary to understand its mechanism. Experimental determination through site-directed mutagenesis and high throughput mass spectrometry is a costly and time taking process, thus, the reliable computational model is required for identification of sulfotyrosine sites.
Methodology: In this paper, we present a computational model for the prediction of the sulfotyrosine sites named iSulfoTyr-PseAAC in which feature vectors are constructed using statistical moments of protein amino acid sequences and various position/composition relative features. These features are in-corporated into PseAAC. The model is validated by jackknife, cross-validation, self-consistency and in-dependent testing.
Results: Accuracy determined through validation was 93.93% for jackknife test, 95.16% for cross-validation, 94.3% for self-consistency and 94.3% for independent testing.
Conclusion: The proposed model has better performance as compared to the existing predictors, how-ever, the accuracy can be improved further, in future, due to increasing number of sulfotyrosine sites in proteins.
m1A-Ensem: accurate identification of 1-methyladenosine sites through ensemble models.
Suleman M, Alturise F, Alkhalifah T, Khan Y BioData Min. 2024; 17(1):4.
PMID: 38360720 PMC: 10868122. DOI: 10.1186/s13040-023-00353-x.
Naseem A, Alturise F, Alkhalifah T, Khan Y J Cheminform. 2023; 15(1):110.
PMID: 37980534 PMC: 10656963. DOI: 10.1186/s13321-023-00773-1.
Alotaibi F, Khan Y Diagnostics (Basel). 2023; 13(13).
PMID: 37443684 PMC: 10340236. DOI: 10.3390/diagnostics13132291.
Hassan Butt A, Alkhalifah T, Alturise F, Khan Y Diagnostics (Basel). 2023; 13(11).
PMID: 37296792 PMC: 10252793. DOI: 10.3390/diagnostics13111940.
iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models.
Suleman M, Alturise F, Alkhalifah T, Khan Y Digit Health. 2023; 9:20552076231165963.
PMID: 37009307 PMC: 10064468. DOI: 10.1177/20552076231165963.