A deep learning approach in predicting products' sentiment ratings: a comparative analysis

Vimala Balakrishnan; Zhongliang Shi; Chuan Liang Law; Regine Lim; Lee Leng Teh; Yue Fan

doi:10.1007/s11227-021-04169-6

A deep learning approach in predicting products' sentiment ratings: a comparative analysis

J Supercomput. 2022;78(5):7206-7226. doi: 10.1007/s11227-021-04169-6. Epub 2021 Nov 5.

Authors

Vimala Balakrishnan¹, Zhongliang Shi¹, Chuan Liang Law², Regine Lim¹, Lee Leng Teh³, Yue Fan¹

Affiliations

¹ Faculty of Computer Science and Information Technology, Universiti Malaya, 50603 Kuala Lumpur, Malaysia.
² Malayan Banking Berhad, 50050 Kuala Lumpur, Malaysia.
³ Datium Insights, 59200 Kuala Lumpur, Malaysia.

Abstract

We present a benchmark comparison of several deep learning models including Convolutional Neural Networks, Recurrent Neural Network and Bi-directional Long Short Term Memory, assessed based on various word embedding approaches, including the Bi-directional Encoder Representations from Transformers (BERT) and its variants, FastText and Word2Vec. Data augmentation was administered using the Easy Data Augmentation approach resulting in two datasets (original versus augmented). All the models were assessed in two setups, namely 5-class versus 3-class (i.e., compressed version). Findings show the best prediction models were Neural Network-based using Word2Vec, with CNN-RNN-Bi-LSTM producing the highest accuracy (96%) and F-score (91.1%). Individually, RNN was the best model with an accuracy of 87.5% and F-score of 83.5%, while RoBERTa had the best F-score of 73.1%. The study shows that deep learning is better for analyzing the sentiments within the text compared to supervised machine learning and provides a direction for future work and research.

Keywords: Customer reviews; Deep learning; Ensemble models; Sentiment rating; Word embeddings.