» Articles » PMID: 31199298

A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study

Overview
Publisher JMIR Publications
Date 2019 Jun 15
PMID 31199298
Citations 21
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Social media use is now ubiquitous, but the growth in social media communications has also made it a convenient digital platform for drug dealers selling controlled substances, opioids, and other illicit drugs. Previous studies and news investigations have reported the use of popular social media platforms as conduits for opioid sales. This study uses deep learning to detect illicit drug dealing on the image and video sharing platform Instagram.

Objective: The aim of this study was to develop and evaluate a machine learning approach to detect Instagram posts related to illegal internet drug dealing.

Methods: In this paper, we describe an approach to detect drug dealers by using a deep learning model on Instagram. We collected Instagram posts using a Web scraper between July 2018 and October 2018 and then compared our deep learning model against 3 different machine learning models (eg, random forest, decision tree, and support vector machine) to assess the performance and accuracy of the model. For our deep learning model, we used the long short-term memory unit in the recurrent neural network to learn the pattern of the text of drug dealing posts. We also manually annotated all posts collected to evaluate our model performance and to characterize drug selling conversations.

Results: From the 12,857 posts we collected, we detected 1228 drug dealer posts comprising 267 unique users. We used cross-validation to evaluate the 4 models, with our deep learning model reaching 95% on F1 score and performing better than the other 3 models. We also found that by removing the hashtags in the text, the model had better performance. Detected posts contained hashtags related to several drugs, including the controlled substance Xanax (1078/1228, 87.78%), oxycodone/OxyContin (321/1228, 26.14%), and illicit drugs lysergic acid diethylamide (213/1228, 17.34%) and 3,4-methylenedioxy-methamphetamine (94/1228, 7.65%). We also observed the use of communication applications for suspected drug trading through user comments.

Conclusions: Our approach using a combination of Web scraping and deep learning was able to detect illegal online drug sellers on Instagram, with high accuracy. Despite increased scrutiny by regulators and policymakers, the Instagram platform continues to host posts from drug dealers, in violation of federal law. Further action needs to be taken to ensure the safety of social media communities and help put an end to this illicit digital channel of sourcing.

Citing Articles

Insights into the Experiences of Persons with Substance Use Disorders During COVID-19 Lockdown in Lagos, Nigeria: A Qualitative Investigation.

Adejoh S, Osazuwa P, Busari-Akinbode S, Gborogen R, Awodein A, Adisa W Subst Use. 2024; 18:29768357241307752.

PMID: 39691942 PMC: 11650566. DOI: 10.1177/29768357241307752.


Is This Safe? Examining Safety Assessments of Illicit Drug Purchasing on Social Media Using Conjoint Analysis.

Haupt M, Cuomo R, Cui M, Mackey T Subst Use Misuse. 2024; 59(7):999-1011.

PMID: 38319039 PMC: 11019931. DOI: 10.1080/10826084.2024.2310507.


Generating Contextual Variables From Web-Based Data for Health Research: Tutorial on Web Scraping, Text Mining, and Spatial Overlay Analysis.

Galvez-Hernandez P, Gonzalez-Viana A, Gonzalez-De Paz L, Shankardass K, Muntaner C JMIR Public Health Surveill. 2024; 10:e50379.

PMID: 38190245 PMC: 10804251. DOI: 10.2196/50379.


The Adverse Effects and Nonmedical Use of Methylphenidate Before and After the Outbreak of COVID-19: Machine Learning Analysis.

Shin H, Yuniar C, Oh S, Purja S, Park S, Lee H J Med Internet Res. 2023; 25:e45146.

PMID: 37585250 PMC: 10468706. DOI: 10.2196/45146.


Illicit Online Pharmacies: A Scoping Review.

Limbu Y, Huhmann B Int J Environ Res Public Health. 2023; 20(9).

PMID: 37174265 PMC: 10178756. DOI: 10.3390/ijerph20095748.


References
1.
Orizio G, Merla A, Schulz P, Gelatti U . Quality of online pharmacies and websites selling prescription drugs: a systematic review. J Med Internet Res. 2011; 13(3):e74. PMC: 3222188. DOI: 10.2196/jmir.1795. View

2.
Cavazos-Rehg P, Krauss M, Sowles S, Bierut L . Marijuana-Related Posts on Instagram. Prev Sci. 2016; 17(6):710-20. PMC: 4939096. DOI: 10.1007/s11121-016-0669-9. View

3.
Mackey T, Kalyanam J, Klugman J, Kuzmenko E, Gupta R . Solution to Detect, Classify, and Report Illicit Online Marketing and Sales of Controlled Substances via Twitter: Using Machine Learning and Web Forensics to Combat Digital Opioid Access. J Med Internet Res. 2018; 20(4):e10029. PMC: 5948414. DOI: 10.2196/10029. View

4.
Pergolizzi Jr J, LeQuang Ba J, Taylor Jr R, Raffa R . The "Darknet": The new street for street drugs. J Clin Pharm Ther. 2017; 42(6):790-792. DOI: 10.1111/jcpt.12628. View

5.
Katsuki T, Mackey T, Cuomo R . Establishing a Link Between Prescription Drug Abuse and Illicit Online Pharmacies: Analysis of Twitter Data. J Med Internet Res. 2015; 17(12):e280. PMC: 4704982. DOI: 10.2196/jmir.5144. View