» Articles » PMID: 23657089

DDBJ Read Annotation Pipeline: a Cloud Computing-based Pipeline for High-throughput Analysis of Next-generation Sequencing Data

Overview
Journal DNA Res
Date 2013 May 10
PMID 23657089
Citations 34
Authors
Affiliations
Soon will be listed here.
Abstract

High-performance next-generation sequencing (NGS) technologies are advancing genomics and molecular biological research. However, the immense amount of sequence data requires computational skills and suitable hardware resources that are a challenge to molecular biologists. The DNA Data Bank of Japan (DDBJ) of the National Institute of Genetics (NIG) has initiated a cloud computing-based analytical pipeline, the DDBJ Read Annotation Pipeline (DDBJ Pipeline), for a high-throughput annotation of NGS reads. The DDBJ Pipeline offers a user-friendly graphical web interface and processes massive NGS datasets using decentralized processing by NIG supercomputers currently free of charge. The proposed pipeline consists of two analysis components: basic analysis for reference genome mapping and de novo assembly and subsequent high-level analysis of structural and functional annotations. Users may smoothly switch between the two components in the pipeline, facilitating web-based operations on a supercomputer for high-throughput data analysis. Moreover, public NGS reads of the DDBJ Sequence Read Archive located on the same supercomputer can be imported into the pipeline through the input of only an accession number. This proposed pipeline will facilitate research by utilizing unified analytical workflows applied to the NGS data. The DDBJ Pipeline is accessible at http://p.ddbj.nig.ac.jp/.

Citing Articles

Relationship between the Rod complex and peptidoglycan structure in Escherichia coli.

Ago R, Tahara Y, Yamaguchi H, Saito M, Ito W, Yamasaki K Microbiologyopen. 2023; 12(5):e1385.

PMID: 37877652 PMC: 10561026. DOI: 10.1002/mbo3.1385.


Lineage-specific, fast-evolving GATA-like gene regulates zygotic gene activation to promote endoderm specification and pattern formation in the Theridiidae spider.

Iwasaki-Yokozawa S, Nanjo R, Akiyama-Oda Y, Oda H BMC Biol. 2022; 20(1):223.

PMID: 36203191 PMC: 9535882. DOI: 10.1186/s12915-022-01421-0.


Metatranscriptomic Analysis of Corals Inoculated With Tolerant and Non-Tolerant Symbiont Exposed to High Temperature and Light Stress.

Yuyama I, Higuchi T, Mezaki T, Tashiro H, Ikeo K Front Physiol. 2022; 13:806171.

PMID: 35480050 PMC: 9037784. DOI: 10.3389/fphys.2022.806171.


Golgi-localized membrane protein AtTMN1/EMP12 functions in the deposition of rhamnogalacturonan II and I for cell growth in Arabidopsis.

Hiroguchi A, Sakamoto S, Mitsuda N, Miwa K J Exp Bot. 2021; 72(10):3611-3629.

PMID: 33587102 PMC: 8096605. DOI: 10.1093/jxb/erab065.


Expression of , the Flowering Inducer of Asiatic Hybrid Lily, in the Bulb Scales.

Kurokawa K, Kobayashi J, Nemoto K, Nozawa A, Sawasaki T, Nakatsuka T Front Plant Sci. 2020; 11:570915.

PMID: 33304361 PMC: 7693649. DOI: 10.3389/fpls.2020.570915.


References
1.
Li H, Durbin R . Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009; 25(14):1754-60. PMC: 2705234. DOI: 10.1093/bioinformatics/btp324. View

2.
Cochrane G, Karsch-Mizrachi I, Nakamura Y . The International Nucleotide Sequence Database Collaboration. Nucleic Acids Res. 2010; 39(Database issue):D15-8. PMC: 3013722. DOI: 10.1093/nar/gkq1150. View

3.
Grabherr M, Haas B, Yassour M, Levin J, Thompson D, Amit I . Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011; 29(7):644-52. PMC: 3571712. DOI: 10.1038/nbt.1883. View

4.
Li H, Durbin R . Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010; 26(5):589-95. PMC: 2828108. DOI: 10.1093/bioinformatics/btp698. View

5.
Li R, Yu C, Li Y, Lam T, Yiu S, Kristiansen K . SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics. 2009; 25(15):1966-7. DOI: 10.1093/bioinformatics/btp336. View