Depression screening using mobile phone usage metadata: a machine learning approach

J Am Med Inform Assoc. 2020 Apr 1;27(4):522-530. doi: 10.1093/jamia/ocz221.

Abstract

Objective: Depression is currently the second most significant contributor to non-fatal disease burdens globally. While it is treatable, depression remains undiagnosed in many cases. As mobile phones have now become an integral part of daily life, this study examines the possibility of screening for depressive symptoms continuously based on patients' mobile usage patterns.

Materials and methods: 412 research participants reported a range of their mobile usage statistics. Beck Depression Inventory-2nd ed (BDI-II) was used to measure the severity of depression among participants. A wide array of machine learning classification algorithms was trained to detect participants with depression symptoms (ie, BDI-II score ≥ 14). The relative importance of individual variables was additionally quantified.

Results: Participants with depression were found to have fewer saved contacts on their devices, spend more time on their mobile devices to make and receive fewer and shorter calls, and send more text messages than participants without depression. The best model was a random forest classifier, which had an out-of-sample balanced accuracy of 0.768. The balanced accuracy increased to 0.811 when participants' age and gender were included.

Discussions/conclusion: The significant predictive power of mobile usage attributes implies that, by collecting mobile usage statistics, mental health mobile applications can continuously screen for depressive symptoms for initial diagnosis or for monitoring the progress of ongoing treatments. Moreover, the input variables used in this study were aggregated mobile usage metadata attributes, which has low privacy sensitivity making it more likely for patients to grant required application permissions.

Keywords: depression; machine learning; mobile health; mobile usage.

MeSH terms

  • Adult
  • Algorithms*
  • Area Under Curve
  • Cell Phone Use / statistics & numerical data*
  • Depression / classification
  • Depression / diagnosis*
  • Depressive Disorder / diagnosis
  • Humans
  • Logistic Models
  • Machine Learning*
  • Mobile Applications*
  • Neural Networks, Computer
  • Sensitivity and Specificity
  • Severity of Illness Index
  • Surveys and Questionnaires
  • Telemedicine*