Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora
Overview
Affiliations
A word's sentiment depends on the domain in which it is used. Computational social science research thus requires sentiment lexicons that are specific to the domains being studied. We combine domain-specific word embeddings with a label propagation framework to induce accurate domain-specific sentiment lexicons using small sets of seed words. We show that our approach achieves state-of-the-art performance on inducing sentiment lexicons from domain-specific corpora and that our purely corpus-based approach outperforms methods that rely on hand-curated resources (e.g., WordNet). Using our framework, we induce and release historical sentiment lexicons for 150 years of English and community-specific sentiment lexicons for 250 online communities from the social media forum Reddit. The historical lexicons we induce show that more than 5% of sentiment-bearing (non-neutral) English words completely switched polarity during the last 150 years, and the community-specific lexicons highlight how sentiment varies drastically between different communities.
The linguistic and emotional effects of weather on UK social media users.
Young J, Arthur R, Williams H Sci Rep. 2025; 15(1):8009.
PMID: 40055332 PMC: 11889188. DOI: 10.1038/s41598-024-82384-w.
Moral Association Graph: A Cognitive Model for Automated Moral Inference.
Ramezani A, Xu Y Top Cogn Sci. 2024; 17(1):120-138.
PMID: 39585761 PMC: 11792775. DOI: 10.1111/tops.12774.
CIDER: Context-sensitive polarity measurement for short-form text.
Young J, Arthur R, Williams H PLoS One. 2024; 19(4):e0299490.
PMID: 38635650 PMC: 11025856. DOI: 10.1371/journal.pone.0299490.
Evaluating criminal justice reform during COVID-19: The need for a novel sentiment analysis package.
Ramjee D, Smith L, Doanvo A, Charpignon M, McNulty-Nebel A, Lett E PLOS Digit Health. 2023; 1(7):e0000063.
PMID: 36812565 PMC: 9931240. DOI: 10.1371/journal.pdig.0000063.
Text Mining Oral Histories in Historical Archaeology.
Brown M, Shackel P Int J Hist Archaeol. 2023; :1-17.
PMID: 36686603 PMC: 9838340. DOI: 10.1007/s10761-022-00680-5.