Characterisation of Temporal Patterns in Step Count Behaviour from Smartphone App Data: An Unsupervised Machine Learning Approach

Int J Environ Res Public Health. 2021 Oct 31;18(21):11476. doi: 10.3390/ijerph182111476.

Abstract

The increasing ubiquity of smartphone data, with greater spatial and temporal coverage than achieved by traditional study designs, have the potential to provide insight into habitual physical activity patterns. This study implements and evaluates the utility of both K-means clustering and agglomerative hierarchical clustering methods in identifying weekly and yearlong physical activity behaviour trends. Characterising the demographics and choice of activity type within the identified clusters of behaviour. Across all seven clusters of seasonal activity behaviour identified, daylight saving was shown to play a key role in influencing behaviour, with increased activity in summer months. Investigation into weekly behaviours identified six clusters with varied roles, of weekday versus weekend, on the likelihood of meeting physical activity guidelines. Preferred type of physical activity likewise varied between clusters, with gender and age strongly associated with cluster membership. Key relationships are identified between weekly clusters and seasonal activity behaviour clusters, demonstrating how short-term behaviours contribute to longer-term activity patterns. Utilising unsupervised machine learning, this study demonstrates how the volume and richness of secondary app data can allow us to move away from aggregate measures of physical activity to better understand temporal variations in habitual physical activity behaviour.

Keywords: big data; cluster analysis; data science; physical activity; secondary data; self-recorded health data; smartphone; unsupervised machine learning.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cluster Analysis
  • Exercise
  • Mobile Applications*
  • Smartphone
  • Unsupervised Machine Learning*