Stress modelling and prediction in presence of scarce data

J Biomed Inform. 2016 Oct:63:344-356. doi: 10.1016/j.jbi.2016.08.023. Epub 2016 Aug 31.

Abstract

Objective: Stress at work is a significant occupational health concern. Recent studies have used various sensing modalities to model stress behaviour based on non-obtrusive data obtained from smartphones. However, when the data for a subject is scarce it becomes a challenge to obtain a good model.

Methods: We propose an approach based on a combination of techniques: semi-supervised learning, ensemble methods and transfer learning to build a model of a subject with scarce data. Our approach is based on the comparison of decision trees to select the closest subject for knowledge transfer.

Results: We present a real-life, unconstrained study carried out with 30 employees within two organisations. The results show that using information (instances or model) from similar subjects can improve the accuracy of the subjects with scarce data. However, using transfer learning from dissimilar subjects can have a detrimental effect on the accuracy. Our proposed ensemble approach increased the accuracy by ≈10% to 71.58% compared to not using any transfer learning technique.

Conclusions: In contrast to high precision but highly obtrusive sensors, using smartphone sensors for measuring daily behaviours allowed us to quantify behaviour changes, relevant to occupational stress. Furthermore, we have shown that use of transfer learning to select data from close models is a useful approach to improve accuracy in presence of scarce data.

Keywords: Ensemble methods; Semi-supervised learning; Stress modelling; Transfer learning.

MeSH terms

  • Algorithms*
  • Decision Trees*
  • Humans
  • Statistics as Topic
  • Stress, Psychological*
  • Supervised Machine Learning
  • Workplace