Archetype analysis and the PHATE algorithm as methods to describe and visualize pregnant women's levels of physical activity knowledge

BMC Public Health. 2024 Apr 15;24(1):1054. doi: 10.1186/s12889-024-18355-7.

Abstract

The knowledge of physical activity (PA) recommended for pregnant women and practical application of it has positive impact on the outcome. Nevertheless, it is estimated that in high-income countries over 40% of pregnant women are insufficiently physically active. One of the reasons is insufficient knowledge pregnant women have about allowed effort during pregnancy and both recommended and not recommended physical activities. Description of knowledge about physical activity the women have and distinguishing patterns of their knowledge is becoming an increasingly important issue. A common approach to handle survey data that reflect knowledge involves clustering methods or Principal Component Analysis (PCA). Nevertheless, new procedures of data analysis are still being sought. Using survey data collected by the Institute of Mother and Child Archetypal analysis has been applied to detect levels of knowledge reflected by answers given in a questionnaire and to derive patterns of knowledge contained in the data. Next, PHATE (Potential of Heat-diffusion for Affinity-based Trajectory Embedding) algorithm has been used to visualize the results and to get a deeper insight into the data structure. The results were compared with picture derived from PCA. Three archetypes representing three patterns of knowledge have been distinguished and described. The presentation of complex data in a low dimension was obtained with help of PHATE. The formations revealed by PHATE have been successfully described in terms of knowledge levels reflected by the survey. Finally, comparison of PHATE with PCA has been shown. Archetype analysis combined with PHATE provides novel opportunities in examining nonlinear structure of survey data and allows for visualization that captures complex relations in the data. PHATE has made it possible to distinguish sets of objects that have common features but were captured neither by Archetypal analysis nor PCA. Moreover, for our data, PHATE provides an image of data structure which is more detailed than interpretation of PCA.

Keywords: Archetypal analysis; Knowledge of physical activity in pregnant women; PCA; PHATE; Patterns of knowledge.

MeSH terms

  • Algorithms
  • Child
  • Exercise*
  • Female
  • Humans
  • Income
  • Pregnancy
  • Pregnant Women*
  • Surveys and Questionnaires