» Articles » PMID: 30345391

Open-access Bacterial Population Genomics: BIGSdb Software, the PubMLST.org Website and Their Applications

Overview
Date 2018 Oct 23
PMID 30345391
Citations 1359
Authors
Affiliations
Soon will be listed here.
Abstract

The PubMLST.org website hosts a collection of open-access, curated databases that integrate population sequence data with provenance and phenotype information for over 100 different microbial species and genera.  Although the PubMLST website was conceived as part of the development of the first multi-locus sequence typing (MLST) scheme in 1998 the software it uses, the Bacterial Isolate Genome Sequence database (BIGSdb, published in 2010), enables PubMLST to include all levels of sequence data, from single gene sequences up to and including complete, finished genomes.  Here we describe developments in the BIGSdb software made from publication to June 2018 and show how the platform realises microbial population genomics for a wide range of applications.  The system is based on the gene-by-gene analysis of microbial genomes, with each deposited sequence annotated and curated to identify the genes present and systematically catalogue their variation.  Originally intended as a means of characterising isolates with typing schemes, the synthesis of sequences and records of genetic variation with provenance and phenotype data permits highly scalable (whole genome sequence data for tens of thousands of isolates) means of addressing a wide range of functional questions, including: the prediction of antimicrobial resistance; likely cross-reactivity with vaccine antigens; and the functional activities of different variants that lead to key phenotypes.  There are no limitations to the number of sequences, genetic loci, allelic variants or schemes (combinations of loci) that can be included, enabling each database to represent an expanding catalogue of the genetic variation of the population in question.  In addition to providing web-accessible analyses and links to third-party analysis and visualisation tools, the BIGSdb software includes a RESTful application programming interface (API) that enables access to all the underlying data for third-party applications and data analysis pipelines.

Citing Articles

A secure visualization platform for pathogenic genome analysis with an accurate reference database.

Fan G, Guo C, Zhang Q, Liu D, Sun Q, Cui Z Biosaf Health. 2025; 6(4):235-243.

PMID: 40078665 PMC: 11894998. DOI: 10.1016/j.bsheal.2024.07.003.


Molecular Epidemiology of Serotype 1: A Systematic Review of Circulating Clones and Clonal Clusters.

Ntim O, Donkor E Int J Mol Sci. 2025; 26(5).

PMID: 40076900 PMC: 11900055. DOI: 10.3390/ijms26052266.


Screening and genomic evaluation of keratinolytic protease producing Chryseobacterium sp. from tannery waste and its potential application in dehairing of goat skin.

Akter T, Sarkar M, Sarker S, Tarannum N, Naser S, Chowdhury S J Genet Eng Biotechnol. 2025; 23(1):100458.

PMID: 40074432 PMC: 11787649. DOI: 10.1016/j.jgeb.2025.100458.


Identification of novel inhibitors targeting serine acetyltransferase from Neisseria gonorrhoeae.

Oldham K, Jiao W, Prentice E, Hicks J Comput Struct Biotechnol J. 2025; 27:682-691.

PMID: 40070520 PMC: 11894326. DOI: 10.1016/j.csbj.2025.02.015.


Genomic and phenotypic characterisation of isolates from canine otitis externa reveals high-risk sequence types identical to those found in human nosocomial infections.

Secker B, Shaw S, Hobley L, Atterbury R Front Microbiol. 2025; 16:1526843.

PMID: 40066269 PMC: 11891389. DOI: 10.3389/fmicb.2025.1526843.


References
1.
Yu Y, Hu W, Wu B, Zhang P, Chen J, Wang S . Vibrio parahaemolyticus isolates from southeastern Chinese coast are genetically diverse with circulation of clonal complex 3 strains since 2002. Foodborne Pathog Dis. 2011; 8(11):1169-76. DOI: 10.1089/fpd.2011.0865. View

2.
Bujan N, Balboa S, Romalde J, Toranzo A, Magarinos B . Population genetic and evolution analysis of controversial genus Edwardsiella by multilocus sequence typing. Mol Phylogenet Evol. 2018; 127:513-521. DOI: 10.1016/j.ympev.2018.05.006. View

3.
Pearce M, Alikhan N, Dallman T, Zhou Z, Grant K, Maiden M . Comparative analysis of core genome MLST and SNP typing within a European Salmonella serovar Enteritidis outbreak. Int J Food Microbiol. 2018; 274:1-11. PMC: 5899760. DOI: 10.1016/j.ijfoodmicro.2018.02.023. View

4.
Yang Y, Yu X, Zhan L, Chen J, Zhang Y, Zhang J . Multilocus sequence type profiles of Bacillus cereus isolates from infant formula in China. Food Microbiol. 2016; 62:46-50. DOI: 10.1016/j.fm.2016.09.007. View

5.
Jolley K, Maiden M . AgdbNet - antigen sequence database software for bacterial typing. BMC Bioinformatics. 2006; 7:314. PMC: 1543660. DOI: 10.1186/1471-2105-7-314. View