Deep learning for religious and continent-based toxic content detection and classification

Ahmed Abbasi; Abdul Rehman Javed; Farkhund Iqbal; Natalia Kryvinska; Zunera Jalil

doi:10.1038/s41598-022-22523-3

Deep learning for religious and continent-based toxic content detection and classification

Sci Rep. 2022 Oct 19;12(1):17478. doi: 10.1038/s41598-022-22523-3.

Authors

Ahmed Abbasi¹, Abdul Rehman Javed^{2

3}, Farkhund Iqbal⁴, Natalia Kryvinska⁵, Zunera Jalil¹

Affiliations

¹ Department of Creative Technologies, PAF Complex, E-9, Air University, Islamabad, Pakistan.
² Department of Cyber Security, PAF Complex, E-9, Air University, Islamabad, Pakistan. abdulrehman.cs@au.edu.pk.
³ Department of Electrical and Computer Engineering, Lebanese American University, Byblos, Lebanon. abdulrehman.cs@au.edu.pk.
⁴ College of Technological Innovation, Zayed University, Abu Dhabi, United Arab Emirates.
⁵ Information Systems Department, Faculty of Management, Comenius University in Bratislava, Odbojárov 10, 82005, Bratislava, 25, Slovakia. natalia.kryvinska@fm.uniba.sk.

Abstract

With time, numerous online communication platforms have emerged that allow people to express themselves, increasing the dissemination of toxic languages, such as racism, sexual harassment, and other negative behaviors that are not accepted in polite society. As a result, toxic language identification in online communication has emerged as a critical application of natural language processing. Numerous academic and industrial researchers have recently researched toxic language identification using machine learning algorithms. However, Nontoxic comments, including particular identification descriptors, such as Muslim, Jewish, White, and Black, were assigned unrealistically high toxicity ratings in several machine learning models. This research analyzes and compares modern deep learning algorithms for multilabel toxic comments classification. We explore two scenarios: the first is a multilabel classification of Religious toxic comments, and the second is a multilabel classification of race or toxic ethnicity comments with various word embeddings (GloVe, Word2vec, and FastText) without word embeddings using an ordinary embedding layer. Experiments show that the CNN model produced the best results for classifying multilabel toxic comments in both scenarios. We compared the outcomes of these modern deep learning model performances in terms of multilabel evaluation metrics.

MeSH terms

Algorithms
Deep Learning*
Humans
Language
Machine Learning
Natural Language Processing