» Articles » PMID: 21474551

CycADS: an Annotation Database System to Ease the Development and Update of BioCyc Databases

Abstract

In recent years, genomes from an increasing number of organisms have been sequenced, but their annotation remains a time-consuming process. The BioCyc databases offer a framework for the integrated analysis of metabolic networks. The Pathway tool software suite allows the automated construction of a database starting from an annotated genome, but it requires prior integration of all annotations into a specific summary file or into a GenBank file. To allow the easy creation and update of a BioCyc database starting from the multiple genome annotation resources available over time, we have developed an ad hoc data management system that we called Cyc Annotation Database System (CycADS). CycADS is centred on a specific database model and on a set of Java programs to import, filter and export relevant information. Data from GenBank and other annotation sources (including for example: KAAS, PRIAM, Blast2GO and PhylomeDB) are collected into a database to be subsequently filtered and extracted to generate a complete annotation file. This file is then used to build an enriched BioCyc database using the PathoLogic program of Pathway Tools. The CycADS pipeline for annotation management was used to build the AcypiCyc database for the pea aphid (Acyrthosiphon pisum) whose genome was recently sequenced. The AcypiCyc database webpage includes also, for comparative analyses, two other metabolic reconstruction BioCyc databases generated using CycADS: TricaCyc for Tribolium castaneum and DromeCyc for Drosophila melanogaster. Linked to its flexible design, CycADS offers a powerful software tool for the generation and regular updating of enriched BioCyc databases. The CycADS system is particularly suited for metabolic gene annotation and network reconstruction in newly sequenced genomes. Because of the uniform annotation used for metabolic network reconstruction, CycADS is particularly useful for comparative analysis of the metabolism of different organisms. Database URL: http://www.cycadsys.org.

Citing Articles

PacBio Hi-Fi genome assembly of Sipha maydis, a model for the study of multipartite mutualism in insects.

Renoz F, Parisot N, Baa-Puyoulet P, Gerlin L, Fakhour S, Charles H Sci Data. 2024; 11(1):450.

PMID: 38704391 PMC: 11069519. DOI: 10.1038/s41597-024-03297-x.


Bacteriocyte plasticity in pea aphids facing amino acid stress or starvation during development.

Ribeiro Lopes M, Gaget K, Renoz F, Duport G, Balmand S, Charles H Front Physiol. 2022; 13:982920.

PMID: 36439244 PMC: 9685537. DOI: 10.3389/fphys.2022.982920.


The transposable element-rich genome of the cereal pest Sitophilus oryzae.

Parisot N, Vargas-Chavez C, Goubert C, Baa-Puyoulet P, Balmand S, Beranger L BMC Biol. 2021; 19(1):241.

PMID: 34749730 PMC: 8576890. DOI: 10.1186/s12915-021-01158-2.


The genome sequence of the grape phylloxera provides insights into the evolution, adaptation, and invasion routes of an iconic pest.

Rispe C, Legeai F, Nabity P, Fernandez R, Arora A, Baa-Puyoulet P BMC Biol. 2020; 18(1):90.

PMID: 32698880 PMC: 7376646. DOI: 10.1186/s12915-020-00820-5.


Sawfly Genomes Reveal Evolutionary Acquisitions That Fostered the Mega-Radiation of Parasitoid and Eusocial Hymenoptera.

Oeyen J, Baa-Puyoulet P, Benoit J, Beukeboom L, Bornberg-Bauer E, Buttstedt A Genome Biol Evol. 2020; 12(7):1099-1188.

PMID: 32442304 PMC: 7455281. DOI: 10.1093/gbe/evaa106.


References
1.
Gauthier J, Legeai F, Zasadzinski A, Rispe C, Tagu D . AphidBase: a database for aphid genomic resources. Bioinformatics. 2007; 23(6):783-4. DOI: 10.1093/bioinformatics/btl682. View

2.
Mardis E . The impact of next-generation sequencing technology on genetics. Trends Genet. 2008; 24(3):133-41. DOI: 10.1016/j.tig.2007.12.007. View

3.
Huerta-Cepas J, Dopazo J, Gabaldon T . ETE: a python Environment for Tree Exploration. BMC Bioinformatics. 2010; 11:24. PMC: 2820433. DOI: 10.1186/1471-2105-11-24. View

4.
Metzker M . Sequencing technologies - the next generation. Nat Rev Genet. 2009; 11(1):31-46. DOI: 10.1038/nrg2626. View

5.
Stein L . Genome annotation: from sequence to biology. Nat Rev Genet. 2001; 2(7):493-503. DOI: 10.1038/35080529. View