» Articles » PMID: 18693923

Adaptive Classifiers, Topic Drifts and GO Annotations

Overview
Date 2008 Aug 13
PMID 18693923
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Gene annotations with Gene Ontology codes offer scientists important options in their study of genes and their functions. Automatic GO annotation methods have the potential to supplement the intensive manual annotation processes. Annotation approaches using MEDLINE documents are generally two-phased where the first is to annotate documents with GO codes and the second is to annotate gene products via the documents. In this paper we study document annotation with GO codes using a temporal perspective. Specifically, we build adaptive code-specific classifiers. We also study topic drift i.e., changes in the contextual characteristics of annotations over time. We show that topic drift is significant especially in the biological process GO hierarchy. This at least partially explains the particular challenges faced with codes of this hierarchy.

Citing Articles

Studying the potential impact of automated document classification on scheduling a systematic review update.

Cohen A, Ambert K, McDonagh M BMC Med Inform Decis Mak. 2012; 12:33.

PMID: 22515596 PMC: 3420236. DOI: 10.1186/1472-6947-12-33.


A Prospective Evaluation of an Automated Classification System to Support Evidence-based Medicine and Systematic Review.

Cohen A, Ambert K, McDonagh M AMIA Annu Symp Proc. 2011; 2010:121-5.

PMID: 21346953 PMC: 3041348.


Cross-topic learning for work prioritization in systematic review creation and update.

Cohen A, Ambert K, McDonagh M J Am Med Inform Assoc. 2009; 16(5):690-704.

PMID: 19567792 PMC: 2744720. DOI: 10.1197/jamia.M3162.

References
1.
Sehgal A, Srinivasan P . Retrieval with gene queries. BMC Bioinformatics. 2006; 7:220. PMC: 1482725. DOI: 10.1186/1471-2105-7-220. View

2.
Yi G, Sze S, Thon M . Identifying clusters of functionally related genes in genomes. Bioinformatics. 2007; 23(9):1053-60. DOI: 10.1093/bioinformatics/btl673. View

3.
Xie H, Wasserman A, Levine Z, Novik A, Grebinskiy V, Shoshan A . Large-scale protein annotation through gene ontology. Genome Res. 2002; 12(5):785-94. PMC: 186564. DOI: 10.1101/gr.86902. View

4.
Rice S, Nenadic G, Stapley B . Mining protein function from text using term-based support vector machines. BMC Bioinformatics. 2005; 6 Suppl 1:S22. PMC: 1869015. DOI: 10.1186/1471-2105-6-S1-S22. View

5.
Ray S, Craven M . Learning statistical models for annotating proteins with function information using biomedical text. BMC Bioinformatics. 2005; 6 Suppl 1:S18. PMC: 1869010. DOI: 10.1186/1471-2105-6-S1-S18. View