» Articles » PMID: 25115450

Of Text and Gene--using Text Mining Methods to Uncover Hidden Knowledge in Toxicogenomics

Overview
Journal BMC Syst Biol
Publisher Biomed Central
Specialty Biology
Date 2014 Aug 14
PMID 25115450
Citations 6
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Toxicogenomics studies often profile gene expression from assays involving multiple doses and time points. The dose- and time-dependent pattern is of great importance to assess toxicity but computational approaches are lacking to effectively utilize this characteristic in toxicity assessment. Topic modeling is a text mining approach, but may be used analogously in toxicogenomics due to the similar data structures between text and gene dysregulation.

Results: Topic modeling was applied to a very large toxicogenomics dataset containing microarray gene expression data from >15,000 samples associated with 131 drugs tested in three different assay platforms (i.e., in vitro assay, in vivo repeated dose study and in vivo single dose experiment) with a design including multiple doses and time points. A set of "topics" which each consist of a set of genes was determined, by which the varying sensitivity of three assay systems was observed. We found that the drug-dependent effect was more pronounced in the two in vivo systems than the in vitro system, while the time-dependent effect was most strongly reflected in the in vitro system followed by the single dose study and lastly the repeated dose experiment. The dose-dependent effect was similar across three assay systems. Although the results indicated a challenge to extrapolate the in vitro results to the in vivo situation, we did notice that, for some drugs but not for all the drugs, the similarity in gene expression patterns was observed across all three assay systems, indicating a possibility of using in vitro systems with careful designs (such as the choice of dose and time point), to replace the in vivo testing strategy. Nonetheless, a potential to replace the repeated dose study by the single-dose short-term methodology was strongly implied.

Conclusions: The study demonstrated that text mining methodologies such as topic modeling provide an alternative method compared to traditional means for data reduction in toxicogenomics, enhancing researchers' capabilities to interpret biological information.

Citing Articles

A Novel Open Access Web Portal for Integrating Mechanistic and Toxicogenomic Study Results.

Sutherland J, Stevens J, Johnson K, Elango N, Webster Y, Mills B Toxicol Sci. 2019; 170(2):296-309.

PMID: 31020328 PMC: 6657575. DOI: 10.1093/toxsci/kfz101.


Transcriptional Responses Reveal Similarities Between Preclinical Rat Liver Testing Systems.

Liu Z, Delavan B, Roberts R, Tong W Front Genet. 2018; 9:74.

PMID: 29616076 PMC: 5870427. DOI: 10.3389/fgene.2018.00074.


Text mining for improved exposure assessment.

Larsson K, Baker S, Silins I, Guo Y, Stenius U, Korhonen A PLoS One. 2017; 12(3):e0173132.

PMID: 28257498 PMC: 5336247. DOI: 10.1371/journal.pone.0173132.


Application of dynamic topic models to toxicogenomics data.

Lee M, Liu Z, Huang R, Tong W BMC Bioinformatics. 2016; 17(Suppl 13):368.

PMID: 27766956 PMC: 5073961. DOI: 10.1186/s12859-016-1225-0.


An overview of topic modeling and its current applications in bioinformatics.

Liu L, Tang L, Dong W, Yao S, Zhou W Springerplus. 2016; 5(1):1608.

PMID: 27652181 PMC: 5028368. DOI: 10.1186/s40064-016-3252-8.


References
1.
Bisgin H, Liu Z, Fang H, Xu X, Tong W . Mining FDA drug labels using an unsupervised learning technique--topic modeling. BMC Bioinformatics. 2011; 12 Suppl 10:S11. PMC: 3236833. DOI: 10.1186/1471-2105-12-S10-S11. View

2.
Bisgin H, Liu Z, Kelly R, Fang H, Xu X, Tong W . Investigating drug repositioning opportunities in FDA drug labels through topic modeling. BMC Bioinformatics. 2012; 13 Suppl 15:S6. PMC: 3439728. DOI: 10.1186/1471-2105-13-S15-S6. View

3.
Chen M, Zhang M, Borlak J, Tong W . A decade of toxicogenomic research and its contribution to toxicological science. Toxicol Sci. 2012; 130(2):217-28. DOI: 10.1093/toxsci/kfs223. View

4.
Dai M, Wang P, Boyd A, Kostov G, Athey B, Jones E . Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data. Nucleic Acids Res. 2005; 33(20):e175. PMC: 1283542. DOI: 10.1093/nar/gni179. View

5.
Uehara T, Ono A, Maruyama T, Kato I, Yamada H, Ohno Y . The Japanese toxicogenomics project: application of toxicogenomics. Mol Nutr Food Res. 2009; 54(2):218-27. DOI: 10.1002/mnfr.200900169. View