» Articles » PMID: 38364094

Annotating Microbial Functions with ProkFunFind

Overview
Journal mSystems
Specialty Microbiology
Date 2024 Feb 16
PMID 38364094
Authors
Affiliations
Soon will be listed here.
Abstract

Analyzing microbial genomes has become an essential part of microbiology research, giving valuable insights into the functions and evolution of microbial species. Identifying genes of interest and assigning putative annotations to those genes is a central task in genome analysis, and a plethora of tools and approaches have been developed for this task. The ProkFunFind tool was developed to bridge the gap between these various annotation approaches, providing a flexible and customizable search approach to annotate microbial functions. ProkFunFind is designed around hierarchical definitions of biological functions, where individual genes can be identified using heterogeneous search terms consisting of sequences, profile hidden Markov models, protein domains, and orthology groups. This flexible and customizable search approach allows for searches to be tailored to specific biological functions, and the search results are output in multiple formats to facilitate downstream analyses. The utility of the ProkFunFind search tool was demonstrated through its application in searching for bacterial flagella, which are complex organelles composed of multiple genes. Overall, ProkFunFind provides an accessible and flexible way to integrate multiple types of annotation and sequence data while annotating biological functions in microbial genomes.IMPORTANCEGenome sequencing and analysis are increasingly important parts of microbiology, providing a way to predict metabolic functions, identify virulence factors, and understand the evolution of microbes. The expanded use of genome sequencing has also brought an abundance of search and annotation methods, but integrating the information from these different methods can be challenging and is often done through approaches. To bridge the gap between different types of annotations, we developed ProkFunFind, a flexible and customizable search tool incorporating multiple search approaches and annotation types to annotate microbial functions. We demonstrated the utility of ProkFunFind by searching for gene clusters encoding flagellar genes using a combination of different annotation types and searches. Overall, ProkFunFind provides a reproducible and flexible way to identify gene clusters of interest, facilitating the meaningful analysis of new and existing microbial genomes.

Citing Articles

Convergent evolution of oxidized sugar metabolism in commensal and pathogenic microbes in the inflamed gut.

Levy S, Jiang A, Grant M, Arp G, Minabou Ndjite G, Jiang X Nat Commun. 2025; 16(1):1121.

PMID: 39875389 PMC: 11775122. DOI: 10.1038/s41467-025-56332-9.


Comprehensive Genomic and Evolutionary Analysis of Biofilm Matrix Clusters and Proteins in the Genus.

Yang Y, Yan J, Olson R, Jiang X bioRxiv. 2024; .

PMID: 39372729 PMC: 11451748. DOI: 10.1101/2024.08.19.608685.

References
1.
Cantalapiedra C, Hernandez-Plaza A, Letunic I, Bork P, Huerta-Cepas J . eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol Biol Evol. 2021; 38(12):5825-5829. PMC: 8662613. DOI: 10.1093/molbev/msab293. View

2.
Cury J, Abby S, Doppelt-Azeroual O, Neron B, Rocha E . Identifying Conjugative Plasmids and Integrative Conjugative Elements with CONJscan. Methods Mol Biol. 2019; 2075:265-283. DOI: 10.1007/978-1-4939-9877-7_19. View

3.
Schwengers O, Jelonek L, Dieckmann M, Beyvers S, Blom J, Goesmann A . Bakta: rapid and standardized annotation of bacterial genomes via alignment-free sequence identification. Microb Genom. 2021; 7(11). PMC: 8743544. DOI: 10.1099/mgen.0.000685. View

4.
Kashyap P, Chia N, Nelson H, Segal E, Elinav E . Microbiome at the Frontier of Personalized Medicine. Mayo Clin Proc. 2017; 92(12):1855-1864. PMC: 5730337. DOI: 10.1016/j.mayocp.2017.10.004. View

5.
Letunic I, Bork P . Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021; 49(W1):W293-W296. PMC: 8265157. DOI: 10.1093/nar/gkab301. View