» Articles » PMID: 21541071

ModEnzA: Accurate Identification of Metabolic Enzymes Using Function Specific Profile HMMs with Optimised Discrimination Threshold and Modified Emission Probabilities

Overview
Specialty Biology
Date 2011 May 5
PMID 21541071
Citations 11
Authors
Affiliations
Soon will be listed here.
Abstract

Various enzyme identification protocols involving homology transfer by sequence-sequence or profile-sequence comparisons have been devised which utilise Swiss-Prot sequences associated with EC numbers as the training set. A profile HMM constructed for a particular EC number might select sequences which perform a different enzymatic function due to the presence of certain fold-specific residues which are conserved in enzymes sharing a common fold. We describe a protocol, ModEnzA (HMM-ModE Enzyme Annotation), which generates profile HMMs highly specific at a functional level as defined by the EC numbers by incorporating information from negative training sequences. We enrich the training dataset by mining sequences from the NCBI Non-Redundant database for increased sensitivity. We compare our method with other enzyme identification methods, both for assigning EC numbers to a genome as well as identifying protein sequences associated with an enzymatic activity. We report a sensitivity of 88% and specificity of 95% in identifying EC numbers and annotating enzymatic sequences from the E. coli genome which is higher than any other method. With the next-generation sequencing methods producing a huge amount of sequence data, the development and use of fully automated yet accurate protocols such as ModEnzA is warranted for rapid annotation of newly sequenced genomes and metagenomic sequences.

Citing Articles

Accurately predicting enzyme functions through geometric graph learning on ESMFold-predicted structures.

Song Y, Yuan Q, Chen S, Zeng Y, Zhao H, Yang Y Nat Commun. 2024; 15(1):8180.

PMID: 39294165 PMC: 11411130. DOI: 10.1038/s41467-024-52533-w.


DeepTM: A deep learning algorithm for prediction of melting temperature of thermophilic proteins directly from sequences.

Li M, Wang H, Yang Z, Zhang L, Zhu Y Comput Struct Biotechnol J. 2023; 21:5544-5560.

PMID: 38034401 PMC: 10681957. DOI: 10.1016/j.csbj.2023.11.006.


Evidential deep learning for trustworthy prediction of enzyme commission number.

Han S, Park M, Kosaraju S, Lee J, Lee H, Lee J Brief Bioinform. 2023; 25(1).

PMID: 37991247 PMC: 10664415. DOI: 10.1093/bib/bbad401.


Implementation of homology based and non-homology based computational methods for the identification and annotation of orphan enzymes: using Mycobacterium tuberculosis H37Rv as a case study.

Sinha S, Lynn A, Desai D BMC Bioinformatics. 2020; 21(1):466.

PMID: 33076816 PMC: 7574302. DOI: 10.1186/s12859-020-03794-x.


Genomics-driven discovery of a biosynthetic gene cluster required for the synthesis of BII-Rafflesfungin from the fungus Phoma sp. F3723.

Sinha S, Nge C, Leong C, Ng V, Crasta S, Alfatah M BMC Genomics. 2019; 20(1):374.

PMID: 31088369 PMC: 6518819. DOI: 10.1186/s12864-019-5762-6.


References
1.
Kelley B, Yuan B, Lewitter F, Sharan R, Stockwell B, Ideker T . PathBLAST: a tool for alignment of protein interaction networks. Nucleic Acids Res. 2004; 32(Web Server issue):W83-8. PMC: 441549. DOI: 10.1093/nar/gkh411. View

2.
Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita K, Itoh M, Kawashima S . From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2005; 34(Database issue):D354-7. PMC: 1347464. DOI: 10.1093/nar/gkj102. View

3.
Gianoulis T, Raes J, Patel P, Bjornson R, Korbel J, Letunic I . Quantifying environmental adaptation of metabolic pathways in metagenomics. Proc Natl Acad Sci U S A. 2009; 106(5):1374-9. PMC: 2629784. DOI: 10.1073/pnas.0808022106. View

4.
Bahl A, Brunk B, Crabtree J, Fraunholz M, Gajria B, Grant G . PlasmoDB: the Plasmodium genome resource. A database integrating experimental and computational data. Nucleic Acids Res. 2003; 31(1):212-5. PMC: 165528. DOI: 10.1093/nar/gkg081. View

5.
Anishetty S, Pulimi M, Pennathur G . Potential drug targets in Mycobacterium tuberculosis through metabolic pathway analysis. Comput Biol Chem. 2005; 29(5):368-78. DOI: 10.1016/j.compbiolchem.2005.07.001. View