» Articles » PMID: 29106469

DFAST: a Flexible Prokaryotic Genome Annotation Pipeline for Faster Genome Publication

Overview
Journal Bioinformatics
Specialty Biology
Date 2017 Nov 7
PMID 29106469
Citations 615
Authors
Affiliations
Soon will be listed here.
Abstract

Summary: We developed a prokaryotic genome annotation pipeline, DFAST, that also supports genome submission to public sequence databases. DFAST was originally started as an on-line annotation server, and to date, over 7000 jobs have been processed since its first launch in 2016. Here, we present a newly implemented background annotation engine for DFAST, which is also available as a standalone command-line program. The new engine can annotate a typical-sized bacterial genome within 10 min, with rich information such as pseudogenes, translation exceptions and orthologous gene assignment between given reference genomes. In addition, the modular framework of DFAST allows users to customize the annotation workflow easily and will also facilitate extensions for new functions and incorporation of new tools in the future.

Availability And Implementation: The software is implemented in Python 3 and runs in both Python 2.7 and 3.4-on Macintosh and Linux systems. It is freely available at https://github.com/nigyta/dfast_core/under the GPLv3 license with external binaries bundled in the software distribution. An on-line version is also available at https://dfast.nig.ac.jp/.

Contact: yn@nig.ac.jp.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

Genomic and pathogenicity analyses to identify the causative agent from multiple serogroups of non-O1, non-O139 in foodborne outbreaks.

Morita M, Hiyoshi H, Arakawa E, Izumiya H, Ohnishi M, Ogata K Microb Genom. 2025; 11(2).

PMID: 40009544 PMC: 11865499. DOI: 10.1099/mgen.0.001364.


Beyond Low Prevalence: Exploring Antibiotic Resistance and Virulence Profiles in Sri Lankan Helicobacter pylori with Comparative Genomics.

Fauzia K, Rathnayake J, Doohan D, Lamawansa M, Alfaray R, Batsaikhan S Microorganisms. 2025; 13(2).

PMID: 40005785 PMC: 11858055. DOI: 10.3390/microorganisms13020420.


Honey-derived spp. with potential to affect bee brood development in : Are they a new threat to honey bees?.

Nakamura K, Okamoto M, Mada T, Harada M, Okumura K, Takamatsu D Virulence. 2025; 16(1):2451170.

PMID: 39954288 PMC: 11834430. DOI: 10.1080/21505594.2025.2451170.


Complete genome sequence of : first isolate of a human clinical specimen from Japan.

Hayashi M, Yonetamari J, Muto Y, Tanaka K Microbiol Resour Announc. 2025; 14(3):e0125224.

PMID: 39918336 PMC: 11895464. DOI: 10.1128/mra.01252-24.


sp. nov., a pathogen causing red stripe of sugarcane in Japan.

Sawada H, Shinohara H, Takashima Y, Naito K, Satou M Int J Syst Evol Microbiol. 2025; 75(2).

PMID: 39907557 PMC: 11797039. DOI: 10.1099/ijsem.0.006575.


References
1.
Seemann T . Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014; 30(14):2068-9. DOI: 10.1093/bioinformatics/btu153. View

2.
Tanizawa Y, Fujisawa T, Kaminuma E, Nakamura Y, Arita M . DFAST and DAGA: web-based integrated genome annotation tools and resources. Biosci Microbiota Food Health. 2016; 35(4):173-184. PMC: 5107635. DOI: 10.12938/bmfh.16-003. View

3.
Suzuki S, Kakuta M, Ishida T, Akiyama Y . GHOSTX: an improved sequence homology search algorithm using a query suffix array and a database suffix array. PLoS One. 2014; 9(8):e103833. PMC: 4123905. DOI: 10.1371/journal.pone.0103833. View

4.
Marchler-Bauer A, Bo Y, Han L, He J, Lanczycki C, Lu S . CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Res. 2016; 45(D1):D200-D203. PMC: 5210587. DOI: 10.1093/nar/gkw1129. View

5.
Cochrane G, Karsch-Mizrachi I, Takagi T . The International Nucleotide Sequence Database Collaboration. Nucleic Acids Res. 2015; 44(D1):D48-50. PMC: 4702924. DOI: 10.1093/nar/gkv1323. View