Flexible Protein Database Based on Amino Acid K-mers
Affiliations
Identification of proteins is one of the most computationally intensive steps in genomics studies. It usually relies on aligners that do not accommodate rich information on proteins and require additional pipelining steps for protein identification. We introduce kAAmer, a protein database engine based on amino-acid k-mers that provides efficient identification of proteins while supporting the incorporation of flexible annotations on these proteins. Moreover, the database is built to be used as a microservice, to be hosted and queried remotely.
Missing microbial eukaryotes and misleading meta-omic conclusions.
Krinos A, Mars Brisbin M, Hu S, Cohen N, Rynearson T, Follows M Nat Commun. 2024; 15(1):9873.
PMID: 39543100 PMC: 11564645. DOI: 10.1038/s41467-024-52212-w.
aaHash: recursive amino acid sequence hashing.
Wong J, Kazemi P, Coombe L, Warren R, Birol I Bioinform Adv. 2023; 3(1):vbad162.
PMID: 38023332 PMC: 10660294. DOI: 10.1093/bioadv/vbad162.