Transfer Learning for Sentiment Analysis Using BERT Based Supervised Fine-Tuning

Nusrat Jahan Prottasha; Abdullah As Sami; Md Kowsher; Saydul Akbar Murad; Anupam Kumar Bairagi; Mehedi Masud; Mohammed Baz

doi:10.3390/s22114157

Transfer Learning for Sentiment Analysis Using BERT Based Supervised Fine-Tuning

Sensors (Basel). 2022 May 30;22(11):4157. doi: 10.3390/s22114157.

Authors

Nusrat Jahan Prottasha¹, Abdullah As Sami², Md Kowsher³, Saydul Akbar Murad⁴, Anupam Kumar Bairagi⁵, Mehedi Masud⁶, Mohammed Baz⁷

Affiliations

¹ Department of Computer Science and Engineering, Daffodil International University, Dhaka 1341, Bangladesh.
² Department of Computer Science & Engineering, Chittagong University of Engineering & Technology, Chattogram 4349, Bangladesh.
³ Department of Computer Science, Stevens Institute of Technology, Hoboken, NJ 07030, USA.
⁴ Faculty of Computing, Universiti Malaysia Pahang, Pekan 26600, Malaysia.
⁵ Computer Science and Engineering Discipline, Khulna University, Khulna 9208, Bangladesh.
⁶ Department of Computer Science, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia.
⁷ Department of Computer Engineering, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia.

Abstract

The growth of the Internet has expanded the amount of data expressed by users across multiple platforms. The availability of these different worldviews and individuals' emotions empowers sentiment analysis. However, sentiment analysis becomes even more challenging due to a scarcity of standardized labeled data in the Bangla NLP domain. The majority of the existing Bangla research has relied on models of deep learning that significantly focus on context-independent word embeddings, such as Word2Vec, GloVe, and fastText, in which each word has a fixed representation irrespective of its context. Meanwhile, context-based pre-trained language models such as BERT have recently revolutionized the state of natural language processing. In this work, we utilized BERT's transfer learning ability to a deep integrated model CNN-BiLSTM for enhanced performance of decision-making in sentiment analysis. In addition, we also introduced the ability of transfer learning to classical machine learning algorithms for the performance comparison of CNN-BiLSTM. Additionally, we explore various word embedding techniques, such as Word2Vec, GloVe, and fastText, and compare their performance to the BERT transfer learning strategy. As a result, we have shown a state-of-the-art binary classification performance for Bangla sentiment analysis that significantly outperforms all embedding and algorithms.

Keywords: Bangla NLP; Bangla-BERT; sentiment analysis; transfer learning; transformer; word embedding.

MeSH terms

Algorithms
Humans
Language
Machine Learning
Natural Language Processing*
Sentiment Analysis*

Grants and funding

This research was funded by Taif University Researchers Supporting Project number (TURSP-2020/239), Taif University, Taif, Saudi Arabia.