Adaptive Classifiers, Topic Drifts and GO Annotations
Overview
Authors
Affiliations
Gene annotations with Gene Ontology codes offer scientists important options in their study of genes and their functions. Automatic GO annotation methods have the potential to supplement the intensive manual annotation processes. Annotation approaches using MEDLINE documents are generally two-phased where the first is to annotate documents with GO codes and the second is to annotate gene products via the documents. In this paper we study document annotation with GO codes using a temporal perspective. Specifically, we build adaptive code-specific classifiers. We also study topic drift i.e., changes in the contextual characteristics of annotations over time. We show that topic drift is significant especially in the biological process GO hierarchy. This at least partially explains the particular challenges faced with codes of this hierarchy.
Cohen A, Ambert K, McDonagh M BMC Med Inform Decis Mak. 2012; 12:33.
PMID: 22515596 PMC: 3420236. DOI: 10.1186/1472-6947-12-33.
Cohen A, Ambert K, McDonagh M AMIA Annu Symp Proc. 2011; 2010:121-5.
PMID: 21346953 PMC: 3041348.
Cross-topic learning for work prioritization in systematic review creation and update.
Cohen A, Ambert K, McDonagh M J Am Med Inform Assoc. 2009; 16(5):690-704.
PMID: 19567792 PMC: 2744720. DOI: 10.1197/jamia.M3162.