A machine learning model to predict privacy fatigued users from social media personalized advertisements

Sci Rep. 2024 Feb 14;14(1):3685. doi: 10.1038/s41598-024-54078-w.

Abstract

The increasing use of social media platforms as personalized advertising channels is a double-edged sword. A high level of personalization on these platforms increases users' sense of losing control over personal data: This could trigger the privacy fatigue phenomenon manifested in emotional exhaustion and cynicism toward privacy, which leads to a lack of privacy-protective behavior. Machine learning has shown its effectiveness in the early prediction of people's psychological state to avoid such consequences. Therefore, this study aims to classify users with low and medium-to-high levels of privacy fatigue, based on their information privacy awareness and big-five personality traits. A dataset was collected from 538 participants via an online questionnaire. The prediction models were built using the Support Vector Machine, Naïve Bayes, K-Nearest Neighbors, Decision Tree, and Random Forest classifiers, based on the literature. The results showed that awareness and conscientiousness trait have a significant relationship with privacy fatigue. Support Vector Machine and Naïve Bayes classifiers outperformed the other classifiers by attaining a classification accuracy of 78%, F1 of 87%, recall of 100% and 98%, and precision of 78% and 79% respectively, using five-fold cross-validation.

MeSH terms

  • Advertising*
  • Bayes Theorem
  • Fatigue
  • Humans
  • Machine Learning
  • Privacy
  • Social Media*
  • Support Vector Machine