Stylometric characteristics of code-switched offensive language in social media

Zhou, Lina; Fu, Zhe

doi:http://doi.org/10.1016/j.im.2025.104153

Stylometric characteristics of code-switched offensive language in social media

Zhou, Lina; Fu, Zhe

2025

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Cite

Files

Abstract

Offensive language is a significant detriment to social media environments. Existing research predominantly assumes monolingual expression, overlooking the prevalent behavior of code-switching (CS). To address this critical knowledge gap, this study identifies and empirically validates the distinct stylometric characteristics of code-switched (CSed) offensive language. Additionally, we developed methods to construct the first social media dataset specifically for CSed offensive content. Our analysis of this dataset reveals that CSed offensive language exhibits unique stylometric characteristics; moreover, these characteristics vary between the language segments involved in the CS. Furthermore, incorporating these features significantly enhances the performance of offensive language detection models. These findings offer significant research and practical implications for social media researchers, platforms, moderators, and users.

Details

Title

Stylometric characteristics of code-switched offensive language in social media

Author

Zhou, Lina (Department of Business Information Systems and Operations Management)
Fu, Zhe (Department of Software and Information Systems)

Date

4/24/2025

Subjects

Social media
Code switching (Linguistics)

Published Version (Please cite this version)

http://doi.org/10.1016/j.im.2025.104153

Link to This Page

Handle: http://hdl.handle.net/20.500.13093/ir:5267

Publication Type

articles

File Format

application/pdf

Language

English

Usage Statement

This item may be protected by copyright and other related rights. Atkins Library provides access to this item for educational and research purposes only; other uses require the permission of the copyright holder.

Record Appears in

Departments and Institutes > Department of Business Information Systems and Operations Management
Departments and Institutes > Department of Software and Information Systems
Types > Articles
Faculty and Staff Works

PDF

Statistics

Download Full History

Stylometric characteristics of code-switched offensive language in social media

Files

Abstract

Details

Related Items

PDF

Statistics