Adaptive sentiment analysis using multioutput classification: a performance comparison

PeerJ Comput Sci. 2023 May 9:9:e1378. doi: 10.7717/peerj-cs.1378. eCollection 2023.

Abstract

The primary objective of this research is to create a multi-output classification model for sentiment analysis through the combination of 10 algorithms: BernoulliNB, Decision Tree, K-nearest neighbor, Logistic Regression, LinearSVC, Bagging, Stacking, Random Forest, AdaBoost, and ExtraTrees. In doing so, we aim to identify the optimal algorithm performance and role within the model. The data utilized in this study is derived from customer reviews of cryptocurrencies in Indonesia. Our results indicate that LinearSVC and Stacking exhibit a high accuracy (90%) compared to the other eight algorithms. The resulting multi-output model demonstrates an average accuracy of 88%, which can be considered satisfactory. This research endeavors to innovate in adaptive sentiment analysis classification by developing a multi-output model that utilizes a combination of 10 classification algorithms.

Keywords: AdaBoost and ExtraTrees; Bagging and Stacking; BernoulliNB; Comparation; Decision Tree; K-nearest neighbor; LinearSVC; Logistic Regression; Multioutput; Random Forest.

Grants and funding

The authors received no funding for this work.