A Fine-Tuned BERT-Based Transfer Learning Approach for Text Classification

Rukhma Qasim; Waqas Haider Bangyal; Mohammed A Alqarni; Abdulwahab Ali Almazroi

doi:10.1155/2022/3498123

A Fine-Tuned BERT-Based Transfer Learning Approach for Text Classification

J Healthc Eng. 2022 Jan 7:2022:3498123. doi: 10.1155/2022/3498123. eCollection 2022.

Authors

Rukhma Qasim¹, Waqas Haider Bangyal¹, Mohammed A Alqarni², Abdulwahab Ali Almazroi³

Affiliations

¹ Dept. of Computer Science, University of Gujrat, Pakistan.
² University of Jeddah, College of Computer Science and Engineering, Department of Software Engineering, Jeddah, Saudi Arabia.
³ University of Jeddah, College of Computing and Information Technology at Khulais, Department of Information Technology, Jeddah, Saudi Arabia.

Abstract

Text Classification problem has been thoroughly studied in information retrieval problems and data mining tasks. It is beneficial in multiple tasks including medical diagnose health and care department, targeted marketing, entertainment industry, and group filtering processes. A recent innovation in both data mining and natural language processing gained the attention of researchers from all over the world to develop automated systems for text classification. NLP allows categorizing documents containing different texts. A huge amount of data is generated on social media sites through social media users. Three datasets have been used for experimental purposes including the COVID-19 fake news dataset, COVID-19 English tweet dataset, and extremist-non-extremist dataset which contain news blogs, posts, and tweets related to coronavirus and hate speech. Transfer learning approaches do not experiment on COVID-19 fake news and extremist-non-extremist datasets. Therefore, the proposed work applied transfer learning classification models on both these datasets to check the performance of transfer learning models. Models are trained and evaluated on the accuracy, precision, recall, and F1-score. Heat maps are also generated for every model. In the end, future directions are proposed.

MeSH terms

COVID-19*
Disinformation*
Humans
Machine Learning
Natural Language Processing
SARS-CoV-2