New approach for the identification of implausible values and outliers in longitudinal childhood anthropometric data

Ann Epidemiol. 2018 Mar;28(3):204-211.e3. doi: 10.1016/j.annepidem.2018.01.007. Epub 2018 Jan 11.

Abstract

Purpose: We aimed to demonstrate the use of jackknife residuals to take advantage of the longitudinal nature of available growth data in assessing potential biologically implausible values and outliers.

Methods: Artificial errors were induced in 5% of length, weight, and head circumference measurements, measured on 1211 participants from the Maternal Vitamin D for Infant Growth (MDIG) trial from birth to 24 months of age. Each child's sex- and age-standardized z-score or raw measurements were regressed as a function of age in child-specific models. Each error responsible for a biologically implausible decrease between a consecutive pair of measurements was identified based on the higher of the two absolute values of jackknife residuals in each pair. In further analyses, outliers were identified as those values beyond fixed cutoffs of the jackknife residuals (e.g., greater than +5 or less than -5 in primary analyses). Kappa, sensitivity, and specificity were calculated over 1000 simulations to assess the ability of the jackknife residual method to detect induced errors and to compare these methods with the use of conditional growth percentiles and conventional cross-sectional methods.

Results: Among the induced errors that resulted in a biologically implausible decrease in measurement between two consecutive values, the jackknife residual method identified the correct value in 84.3%-91.5% of these instances when applied to the sex- and age-standardized z-scores, with kappa values ranging from 0.685 to 0.795. Sensitivity and specificity of the jackknife method were higher than those of the conditional growth percentile method, but specificity was lower than for conventional cross-sectional methods.

Conclusions: Using jackknife residuals provides a simple method to identify biologically implausible values and outliers in longitudinal child growth data sets in which each child contributes at least 4 serial measurements.

Keywords: Biologically implausible values; Jackknife residuals; Longitudinal growth data; Outliers.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Child
  • Child Development / physiology*
  • Child, Preschool
  • Data Interpretation, Statistical*
  • Female
  • Growth Charts*
  • Humans
  • Infant
  • Infant, Newborn
  • Longitudinal Studies
  • Male
  • Models, Biological*
  • Weight Gain / physiology*