The Co-regulation Data Harvester: Automating Gene Annotation Starting from a Transcriptome Database
Overview
Authors
Affiliations
Identifying co-regulated genes provides a useful approach for defining pathway-specific machinery in an organism. To be efficient, this approach relies on thorough genome annotation, a process much slower than genome sequencing , a unicellular eukaryote, has been a useful model organism and has a fully sequenced but sparsely annotated genome. One important resource for studying this organism has been an online transcriptomic database. We have developed an automated approach to gene annotation in the context of transcriptome data in , called the Co-regulation Data Harvester (CDH). Beginning with a gene of interest, the CDH identifies co-regulated genes by accessing the transcriptome database. It then identifies their closely related genes (orthologs) in other organisms by using reciprocal BLAST searches. Finally, it collates the annotations of those orthologs' functions, which provides the user with information to help predict the cellular role of the initial query. The CDH, which is freely available, represents a powerful new tool for analyzing cell biological pathways in . Moreover, to the extent that genes and pathways are conserved between organisms, the inferences obtained the CDH should be relevant, and can be explored, in many other systems.
Bertagna M, Bright L, Ye F, Jiang Y, Sarkar D, Pradhan A bioRxiv. 2024; .
PMID: 39713406 PMC: 11661410. DOI: 10.1101/2024.12.12.627356.
An apical membrane complex for triggering rhoptry exocytosis and invasion in Toxoplasma.
Sparvoli D, Delabre J, Penarete-Vargas D, Mageswaran S, Tsypin L, Heckendorn J EMBO J. 2022; 41(22):e111158.
PMID: 36245278 PMC: 9670195. DOI: 10.15252/embj.2022111158.
Proteins that control the geometry of microtubules at the ends of cilia.
Louka P, Vasudevan K, Guha M, Joachimiak E, Wloga D, Tomasi R J Cell Biol. 2018; 217(12):4298-4313.
PMID: 30217954 PMC: 6279374. DOI: 10.1083/jcb.201804141.