A distributional approach to obtain adjusted comparisons of proportions of a population at risk

Emerg Themes Epidemiol. 2016 Jun 7:13:8. doi: 10.1186/s12982-016-0050-2. eCollection 2016.

Abstract

Background: Dichotomisation of continuous data has statistical drawbacks such as loss of power but may be useful in epidemiological research to define high risk individuals.

Methods: We extend a methodology for the presentation of comparison of proportions derived from a comparison of means for a continuous outcome to reflect the relationship between a continuous outcome and covariates in a linear (mixed) model without losing statistical power. The so called "distributional method" is described and using perinatal data for illustration, results from the distributional method are compared to those of logistic regression and to quantile regression for three different outcomes.

Results: Estimates obtained using the distributional method for the comparison of proportions are consistently more precise than those obtained using logistic regression. For one of the three outcomes the estimates obtained from the distributional method and from logistic regression disagreed highlighting that the relationships between outcome and covariate differ conceptually between the two models.

Conclusion: When an outcome follows the required condition of distribution shift between exposure groups, the results of a linear regression model can be followed by the corresponding comparison of proportions at risk. This dual approach provides more precise estimates than logistic regression thus avoiding the drawback of the usual dichotomisation of continuous outcomes.

Keywords: Dichotomisation; Linear model; Logistic model; Quantile regression.