Discretizing continuous variables in nutrition and obesity research: a practice that needs to be cut short

Nutr Diabetes. 2023 Nov 8;13(1):20. doi: 10.1038/s41387-023-00248-0.

Abstract

Background/objectives: Nutrition and obesity researchers often dichotomize or discretize continuous independent variables to conduct an analysis of variance to examine group differences. We describe consequences associated with dichotomizing and discretizing continuous variables using two cross-sectional studies related to nutrition.

Subjects/methods: Study 1 investigated the effects of health literacy and nutrition knowledge on nutrition label accuracy (n = 612). Study 2 investigated the effects of cognitive restraint and BMI on fruit and vegetable (F/V) intake (n = 586). We compare analytic approaches where continuous independent variables were either discretized/dichotomized or analyzed as continuous variables.

Results: In Study 1, dichotomization of health literacy and nutrition knowledge for 2 × 2 ANOVA revealed health literacy had an effect on nutrition label accuracy. Nutrition knowledge has an effect on nutrition label accuracy, but the health literacy by nutrition knowledge interaction was not significant. When analyzed using regression, the nutrition knowledge effect was significant. The simple effect of health literacy was also significant when health literacy equals zero. Finally, the quadratic effect of health literacy was negative and significant. In Study 2, dichotomization and discretization of cognitive restraint and BMI were used for three ANOVAs, which discretized BMI in three ways. For all ANOVAs, the BMI main effect for predicting fruit and vegetable intake was significant, the interaction between BMI and cognitive restraint was non-significant, and cognitive restraint was only significant when both variables were dichotomized. When analyzed using regression, the continuous mean-centered variables, and their interaction each significantly predicted F/V intake.

Conclusions: Dichotomizing continuous independent variables resulted in distortions of effect sizes across studies, an inability to assess the quadratic effect of health literacy, and an inability to detect the moderating effect of BMI. We discourage researchers from dichotomizing and discretizing continuous independent variables and instead use multiple regression to examine relationships between continuous independent and dependent variables.

MeSH terms

  • Cross-Sectional Studies
  • Eating*
  • Humans
  • Nutritional Status*
  • Obesity