» Articles » PMID: 35650262

Flexible Protein Database Based on Amino Acid K-mers

Overview
Journal Sci Rep
Specialty Science
Date 2022 Jun 1
PMID 35650262
Authors
Affiliations
Soon will be listed here.
Abstract

Identification of proteins is one of the most computationally intensive steps in genomics studies. It usually relies on aligners that do not accommodate rich information on proteins and require additional pipelining steps for protein identification. We introduce kAAmer, a protein database engine based on amino-acid k-mers that provides efficient identification of proteins while supporting the incorporation of flexible annotations on these proteins. Moreover, the database is built to be used as a microservice, to be hosted and queried remotely.

Citing Articles

Missing microbial eukaryotes and misleading meta-omic conclusions.

Krinos A, Mars Brisbin M, Hu S, Cohen N, Rynearson T, Follows M Nat Commun. 2024; 15(1):9873.

PMID: 39543100 PMC: 11564645. DOI: 10.1038/s41467-024-52212-w.


aaHash: recursive amino acid sequence hashing.

Wong J, Kazemi P, Coombe L, Warren R, Birol I Bioinform Adv. 2023; 3(1):vbad162.

PMID: 38023332 PMC: 10660294. DOI: 10.1093/bioadv/vbad162.

References
1.
Priyam A, Woodcroft B, Rai V, Moghul I, Munagala A, Ter F . Sequenceserver: A Modern Graphical User Interface for Custom BLAST Databases. Mol Biol Evol. 2019; 36(12):2922-2924. PMC: 6878946. DOI: 10.1093/molbev/msz185. View

2.
Steinegger M, Soding J . MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat Biotechnol. 2017; 35(11):1026-1028. DOI: 10.1038/nbt.3988. View

3.
Mitchell A, Almeida A, Beracochea M, Boland M, Burgin J, Cochrane G . MGnify: the microbiome analysis resource in 2020. Nucleic Acids Res. 2019; 48(D1):D570-D578. PMC: 7145632. DOI: 10.1093/nar/gkz1035. View

4.
Alcock B, Raphenya A, Lau T, Tsang K, Bouchard M, Edalatmand A . CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database. Nucleic Acids Res. 2019; 48(D1):D517-D525. PMC: 7145624. DOI: 10.1093/nar/gkz935. View

5.
Zankari E, Hasman H, Cosentino S, Vestergaard M, Rasmussen S, Lund O . Identification of acquired antimicrobial resistance genes. J Antimicrob Chemother. 2012; 67(11):2640-4. PMC: 3468078. DOI: 10.1093/jac/dks261. View