» Articles » PMID: 23892295

PREDOSE: a Semantic Web Platform for Drug Abuse Epidemiology Using Social Media

Overview
Journal J Biomed Inform
Publisher Elsevier
Date 2013 Jul 30
PMID 23892295
Citations 40
Authors
Affiliations
Soon will be listed here.
Abstract

Objectives: The role of social media in biomedical knowledge mining, including clinical, medical and healthcare informatics, prescription drug abuse epidemiology and drug pharmacology, has become increasingly significant in recent years. Social media offers opportunities for people to share opinions and experiences freely in online communities, which may contribute information beyond the knowledge of domain professionals. This paper describes the development of a novel semantic web platform called PREDOSE (PREscription Drug abuse Online Surveillance and Epidemiology), which is designed to facilitate the epidemiologic study of prescription (and related) drug abuse practices using social media. PREDOSE uses web forum posts and domain knowledge, modeled in a manually created Drug Abuse Ontology (DAO--pronounced dow), to facilitate the extraction of semantic information from User Generated Content (UGC), through combination of lexical, pattern-based and semantics-based techniques. In a previous study, PREDOSE was used to obtain the datasets from which new knowledge in drug abuse research was derived. Here, we report on various platform enhancements, including an updated DAO, new components for relationship and triple extraction, and tools for content analysis, trend detection and emerging patterns exploration, which enhance the capabilities of the PREDOSE platform. Given these enhancements, PREDOSE is now more equipped to impact drug abuse research by alleviating traditional labor-intensive content analysis tasks.

Methods: Using custom web crawlers that scrape UGC from publicly available web forums, PREDOSE first automates the collection of web-based social media content for subsequent semantic annotation. The annotation scheme is modeled in the DAO, and includes domain specific knowledge such as prescription (and related) drugs, methods of preparation, side effects, and routes of administration. The DAO is also used to help recognize three types of data, namely: (1) entities, (2) relationships and (3) triples. PREDOSE then uses a combination of lexical and semantic-based techniques to extract entities and relationships from the scraped content, and a top-down approach for triple extraction that uses patterns expressed in the DAO. In addition, PREDOSE uses publicly available lexicons to identify initial sentiment expressions in text, and then a probabilistic optimization algorithm (from related research) to extract the final sentiment expressions. Together, these techniques enable the capture of fine-grained semantic information, which facilitate search, trend analysis and overall content analysis using social media on prescription drug abuse. Moreover, extracted data are also made available to domain experts for the creation of training and test sets for use in evaluation and refinements in information extraction techniques.

Results: A recent evaluation of the information extraction techniques applied in the PREDOSE platform indicates 85% precision and 72% recall in entity identification, on a manually created gold standard dataset. In another study, PREDOSE achieved 36% precision in relationship identification and 33% precision in triple extraction, through manual evaluation by domain experts. Given the complexity of the relationship and triple extraction tasks and the abstruse nature of social media texts, we interpret these as favorable initial results. Extracted semantic information is currently in use in an online discovery support system, by prescription drug abuse researchers at the Center for Interventions, Treatment and Addictions Research (CITAR) at Wright State University.

Conclusion: A comprehensive platform for entity, relationship, triple and sentiment extraction from such abstruse texts has never been developed for drug abuse research. PREDOSE has already demonstrated the importance of mining social media by providing data from which new findings in drug abuse research were uncovered. Given the recent platform enhancements, including the refined DAO, components for relationship and triple extraction, and tools for content, trend and emerging pattern analysis, it is expected that PREDOSE will play a significant role in advancing drug abuse epidemiology in future.

Citing Articles

Triangulating evidence in health sciences with Annotated Semantic Queries.

Liu Y, Gaunt T Bioinformatics. 2024; 40(9).

PMID: 39171832 PMC: 11377847. DOI: 10.1093/bioinformatics/btae519.


Monitoring Adverse Drug Events in Web Forums: Evaluation of a Pipeline and Use Case Study.

Karapetiantz P, Audeh B, Redjdal A, Tiffet T, Bousquet C, Jaulent M J Med Internet Res. 2024; 26:e46176.

PMID: 38888956 PMC: 11220433. DOI: 10.2196/46176.


Detecting Substance Use Disorder Using Social Media Data and the Dark Web: Time- and Knowledge-Aware Study.

Lokala U, Phukan O, Dastidar T, Lamy F, Daniulaityte R, Sheth A JMIRx Med. 2024; 5:e48519.

PMID: 38717384 PMC: 11084118. DOI: 10.2196/48519.


Identifying drivers of COVID-19 vaccine sentiments for effective vaccination policy.

Sufi F, Alsulami M Heliyon. 2023; 9(9):e19195.

PMID: 37681141 PMC: 10481186. DOI: 10.1016/j.heliyon.2023.e19195.


Methods for Analyzing the Contents of Social Media for Health Care: Scoping Review.

Fu J, Li C, Zhou C, Li W, Lai J, Deng S J Med Internet Res. 2023; 25:e43349.

PMID: 37358900 PMC: 10337469. DOI: 10.2196/43349.


References
1.
Falck R, Carlson R, Wang J, Siegal H . Sources of information about MDMA (3,4-methylenedioxymethamphetamine): perceived accuracy, importance, and implications for prevention among young adult users. Drug Alcohol Depend. 2004; 74(1):45-54. DOI: 10.1016/j.drugalcdep.2003.11.009. View

2.
Bantum E, Owen J . Evaluating the validity of computerized content analysis programs for identification of emotional expression in cancer narratives. Psychol Assess. 2009; 21(1):79-88. DOI: 10.1037/a0014643. View

3.
Boyer E, Shannon M, Hibberd P . The Internet and psychoactive substance use among innovative drug users. Pediatrics. 2005; 115(2):302-5. DOI: 10.1542/peds.2004-1199. View

4.
Cicero T, Adams E, Geller A, INCIARDI J, Munoz A, Schnoll S . A postmarketing surveillance program to monitor Ultram (tramadol hydrochloride) abuse in the United States. Drug Alcohol Depend. 2000; 57(1):7-22. DOI: 10.1016/s0376-8716(99)00041-1. View

5.
Griffiths P, Vingoe L, Hunt N, Mounteney J, Hartnoll R . Drug information systems, early warning, and new drug trends: can drug monitoring systems become more sensitive to emerging trends in drug consumption?. Subst Use Misuse. 2000; 35(6-8):811-44. DOI: 10.3109/10826080009148423. View