» Articles » PMID: 39220951

From Hate to Harmony: Leveraging Large Language Models for Safer Speech in Times of COVID-19 Crisis

Overview
Journal Heliyon
Specialty Social Sciences
Date 2024 Sep 2
PMID 39220951
Authors
Affiliations
Soon will be listed here.
Abstract

This study investigates the rampant spread of offensive and derogatory language during the COVID-19 pandemic and aims to mitigate it through machine learning. Employing advanced Large Language Models (LLMs), the research develops a sophisticated framework adept at detecting and transforming abusive and hateful speech. The project begins by meticulously compiling a dataset, focusing specifically on Chinese language abuse and hate speech. It incorporates an extensive list of 30 pandemic-related terms, significantly enriching the resources available for this type of research. A two-tier detection model is then introduced, achieving a remarkable accuracy of 94.42 % in its first phase and an impressive 81.48 % in the second. Furthermore, the study enhances paraphrasing efficiency by integrating generative AI techniques, primarily Large Language Models, with a Latent Dirichlet Allocation (LDA) topic model. This combination allows for a thorough analysis of language before and after modification. The results highlight the transformative power of these methods. They show that the rephrased statements not only reduce the initial hostility but also preserve the essential themes and meanings. This breakthrough offers users effective rephrasing suggestions to prevent the spread of hate speech, contributing to more positive and constructive public discourse.

References
1.
Vidgen B, Derczynski L . Directions in abusive language training data, a systematic review: Garbage in, garbage out. PLoS One. 2020; 15(12):e0243300. PMC: 7769249. DOI: 10.1371/journal.pone.0243300. View

2.
Hochreiter S, Schmidhuber J . Long short-term memory. Neural Comput. 1997; 9(8):1735-80. DOI: 10.1162/neco.1997.9.8.1735. View

3.
Wang J, Zhang X, Liu W, Li P . Spatiotemporal pattern evolution and influencing factors of online public opinion--Evidence from the early-stage of COVID-19 in China. Heliyon. 2023; 9(9):e20080. PMC: 10559807. DOI: 10.1016/j.heliyon.2023.e20080. View

4.
Graves A, Schmidhuber J . Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 2005; 18(5-6):602-10. DOI: 10.1016/j.neunet.2005.06.042. View

5.
Haman M . The use of Twitter by state leaders and its impact on the public during the COVID-19 pandemic. Heliyon. 2020; 6(11):e05540. PMC: 7695954. DOI: 10.1016/j.heliyon.2020.e05540. View