Granular Emotion Detection for Multi-class Sentiment Analysis in Social Media

Frye, Robert

Granular Emotion Detection for Multi-class Sentiment Analysis in Social Media

Search for this publication on Google Scholar

Frye, R. (2022). Granular Emotion Detection for Multi-class Sentiment Analysis in Social Media. Unc Charlotte Electronic Theses And Dissertations.

Download PDF

Analytics

18 views ◎
5 downloads ⇓

Abstract

Sentiment analysis for text classification generally refers to assessing the polarity of the emotional context of written text, whether in a binary (e.g. positive or negative) or trinary (e.g. positive, neutral, or negative) state. Granular emotion detection is a more specialized form of sentiment analysis, wherein we move from predicting sentiment polarity to detecting specific classes of emotions within text (e.g. happy, sad, anger, love, hate, etc.), whether that context is a reflection of the author's own emotional state or the emotional state the author intended to convey. Granular emotion detection is broadly applicable to the business world, with common applications in customer satisfaction and retention, as well as studies of marketing effectiveness. Other applications include attempting to identify angry people based on their social media posts and prevent them from committing acts of violence. Current approaches to multi-class emotion classification show mixed or limited results, and improving accuracy for multiple classes of emotions is an open research challenge. Moreover, many modern application contexts align more directly with social media content or have a shorter format more akin to social media, where texts often bend or violate standard language conventions. Overall, understanding emotion detection in social media (EMDISM) contexts is an open challenge.To address the challenge of granular emotion detection in social media text, I have investigated ensemble approaches that combine a variety of individual classifiers to address tradeoffs in performance. This involved first investigating EMDISM performance for individual traditional machine learning (ML), deep learning (DL), and transformer learning (TL) classifiers. Based on this analysis, the second stage investigated the creation of ensembles of the most accurate classifiers across these general classes which offer comparatively improved performance. The approaches were evaluated based on a large Twitter dataset with more than 1.2M samples and encompassing seven discrete emotions. I provide results and analysis for each classifier I considered as well as the most accurate ensembles I created from the most accurate singleton classifiers. Results show that the proposed ensemble approaches - simple voting, weighted voting, cascading, and cascading/switching - improve upon the state of the art for average accuracy, weighted precision, weighted recall, and weighted f-measure as compared to the most accurate single classifier for EMDISM.

Details

Author: Frye, Robert
Title: Granular Emotion Detection for Multi-class Sentiment Analysis in Social Media
Physical Description: 1 online resource (153 pages) : PDF
Date: 2022
Degree Granting Institution: University of North Carolina at Charlotte
Abstract: Sentiment analysis for text classification generally refers to assessing the polarity of the emotional context of written text, whether in a binary (e.g. positive or negative) or trinary (e.g. positive, neutral, or negative) state. Granular emotion detection is a more specialized form of sentiment analysis, wherein we move from predicting sentiment polarity to detecting specific classes of emotions within text (e.g. happy, sad, anger, love, hate, etc.), whether that context is a reflection of the author's own emotional state or the emotional state the author intended to convey. Granular emotion detection is broadly applicable to the business world, with common applications in customer satisfaction and retention, as well as studies of marketing effectiveness. Other applications include attempting to identify angry people based on their social media posts and prevent them from committing acts of violence. Current approaches to multi-class emotion classification show mixed or limited results, and improving accuracy for multiple classes of emotions is an open research challenge. Moreover, many modern application contexts align more directly with social media content or have a shorter format more akin to social media, where texts often bend or violate standard language conventions. Overall, understanding emotion detection in social media (EMDISM) contexts is an open challenge.To address the challenge of granular emotion detection in social media text, I have investigated ensemble approaches that combine a variety of individual classifiers to address tradeoffs in performance. This involved first investigating EMDISM performance for individual traditional machine learning (ML), deep learning (DL), and transformer learning (TL) classifiers. Based on this analysis, the second stage investigated the creation of ensembles of the most accurate classifiers across these general classes which offer comparatively improved performance. The approaches were evaluated based on a large Twitter dataset with more than 1.2M samples and encompassing seven discrete emotions. I provide results and analysis for each classifier I considered as well as the most accurate ensembles I created from the most accurate singleton classifiers. Results show that the proposed ensemble approaches - simple voting, weighted voting, cascading, and cascading/switching - improve upon the state of the art for average accuracy, weighted precision, weighted recall, and weighted f-measure as compared to the most accurate single classifier for EMDISM.
Genre: doctoral dissertations
Subjects--Topics: Computer science
Degree: Ph.D.
Keywords: Emdism
Emotion Detection
Ensemble
Natural Language Processing
Sentiment Analysis
Social Media
Subject Area: Computer Science
Advisor(s): Wilson, David
Committee Members: Niu, Xi
Najjar, Nadia
Ge, Yaorong
Degree Note: Thesis (Ph.D.)--University of North Carolina at Charlotte, 2022.
Rights Statement: This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). For additional information, see http://rightsstatements.org/page/InC/1.0/.
Rights Holder Information: Copyright is held by the author unless otherwise indicated.
Identifier: Frye_uncc_0694D_13183
Permalink: http://hdl.handle.net/20.500.13093/etd:3045

J. Murrey Atkins Library

J. Murrey Atkins Library