» Articles » PMID: 36960290

The -mer Antibiotic Resistance Gene Variant Analyzer (KARGVA)

Overview
Journal Front Microbiol
Specialty Microbiology
Date 2023 Mar 24
PMID 36960290
Authors
Affiliations
Soon will be listed here.
Abstract

Characterization of antibiotic resistance genes (ARGs) from high-throughput sequencing data of metagenomics and cultured bacterial samples is a challenging task, with the need to account for both computational (e.g., string algorithms) and biological (e.g., gene transfers, rearrangements) aspects. Curated ARG databases exist together with assorted ARG classification approaches (e.g., database alignment, machine learning). Besides ARGs that naturally occur in bacterial strains or are acquired through mobile elements, there are chromosomal genes that can render a bacterium resistant to antibiotics through point mutations, i.e., ARG variants (ARGVs). While ARG repositories also collect ARGVs, there are only a few tools that are able to identify ARGVs from metagenomics and high throughput sequencing data, with a number of limitations (e.g., pre-assembly, verification of mutations, or specification of species). In this work we present the -mer, i.e., strings of fixed length , ARGV analyzer - KARGVA - an open-source, multi-platform tool that provides: (i) an , large ARGV database derived from multiple sources; (ii) input capability for various types of high-throughput sequencing data; (iii) a three-way, hash-based, -mer search setup to process data efficiently, linking -mers to ARGVs, -mers to point mutations, and ARGVs to -mers, respectively; (iv) a statistical filter on sequence classification to reduce type I and II errors. On semi-synthetic data, KARGVA provides very high accuracy even in presence of high sequencing errors or mutations (99.2 and 86.6% accuracy within 1 and 5% base change rates, respectively), and genome rearrangements (98.2% accuracy), with robust performance on false positive sets. On data from the worldwide MetaSUB consortium, comprising 3,700+ metagenomics experiments, KARGVA identifies more ARGVs than Resistance Gene Identifier (4.8x) and PointFinder (6.8x), yet all predictions are below the expected false positive estimates. The prevalence of ARGVs is correlated to ARGs but ecological characteristics do not explain well ARGV variance. KARGVA is publicly available at https://github.com/DataIntellSystLab/KARGVA under MIT license.

Citing Articles

Forecasting dominance of SARS-CoV-2 lineages by anomaly detection using deep AutoEncoders.

Rancati S, Nicora G, Prosperi M, Bellazzi R, Salemi M, Marini S Brief Bioinform. 2024; 25(6).

PMID: 39446192 PMC: 11500442. DOI: 10.1093/bib/bbae535.


OCTOPUS: Disk-based, Multiplatform, Mobile-friendly Metagenomics Classifier.

Marini S, Barquero A, Wadhwani A, Bian J, Ruiz J, Boucher C bioRxiv. 2024; .

PMID: 38559026 PMC: 10979967. DOI: 10.1101/2024.03.15.585215.


Blanket antimicrobial resistance gene database with structural information, BOARDS, provides insights on historical landscape of resistance prevalence and effects of mutations in enzyme structure.

Ko S, Kim J, Lim J, Lee S, Park J, Woo J mSystems. 2023; 9(1):e0094323.

PMID: 38085058 PMC: 10871167. DOI: 10.1128/msystems.00943-23.


Forecasting dominance of SARS-CoV-2 lineages by anomaly detection using deep AutoEncoders.

Rancati S, Nicora G, Prosperi M, Bellazzi R, Salemi M, Marini S bioRxiv. 2023; .

PMID: 37961168 PMC: 10634784. DOI: 10.1101/2023.10.24.563721.


MEGARes and AMR++, v3.0: an updated comprehensive database of antimicrobial resistance determinants and an improved software pipeline for classification using high-throughput sequencing.

Bonin N, Doster E, Worley H, Pinnell L, Bravo J, Ferm P Nucleic Acids Res. 2022; 51(D1):D744-D752.

PMID: 36382407 PMC: 9825433. DOI: 10.1093/nar/gkac1047.

References
1.
Bonin N, Doster E, Worley H, Pinnell L, Bravo J, Ferm P . MEGARes and AMR++, v3.0: an updated comprehensive database of antimicrobial resistance determinants and an improved software pipeline for classification using high-throughput sequencing. Nucleic Acids Res. 2022; 51(D1):D744-D752. PMC: 9825433. DOI: 10.1093/nar/gkac1047. View

2.
de Abreu V, Perdigao J, Almeida S . Metagenomic Approaches to Analyze Antimicrobial Resistance: An Overview. Front Genet. 2021; 11:575592. PMC: 7848172. DOI: 10.3389/fgene.2020.575592. View

3.
Arango-Argoty G, Garner E, Pruden A, Heath L, Vikesland P, Zhang L . DeepARG: a deep learning approach for predicting antibiotic resistance genes from metagenomic data. Microbiome. 2018; 6(1):23. PMC: 5796597. DOI: 10.1186/s40168-018-0401-z. View

4.
Alcock B, Raphenya A, Lau T, Tsang K, Bouchard M, Edalatmand A . CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database. Nucleic Acids Res. 2019; 48(D1):D517-D525. PMC: 7145624. DOI: 10.1093/nar/gkz935. View

5.
Sayers E, Beck J, Brister J, Bolton E, Canese K, Comeau D . Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2019; 48(D1):D9-D16. PMC: 6943063. DOI: 10.1093/nar/gkz899. View