Decomposing Pearson's χ2 test: A linear regression and its departure from linearity

Ann Hum Genet. 2018 Sep;82(5):318-324. doi: 10.1111/ahg.12257. Epub 2018 May 31.

Abstract

In case-control genetic association studies, a standard practice is to perform the Cochran-Armitage (CA) trend test under the assumption of the additive model because of its robustness. We could even identify situations in which it outperformed the analysis model consistent with the underlying inheritance mode. In this article, we analytically reveal the statistical basis that leads to the phenomenon. By elucidating the origin of the CA trend test as a linear regression model, we decompose Pearson's χ2 -test statistic into two components-one is the CA trend test statistic that measures the goodness of fit of the linear regression model, and the other measures the discrepancy between data and the linear regression model. Under this framework, we show that the additive coding scheme, as well as the multiplicative coding scheme, increases the coefficient of determination of the regression model by increasing the spread of data points. We also obtain the conditions under which the CA trend test statistic equals the MAX statistic and Pearson's χ2 -test statistic.

Keywords: Pearson's chi-squared test; linear regression; ordinary least squares; trend test.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Case-Control Studies
  • Chi-Square Distribution*
  • Genetic Association Studies
  • Humans
  • Linear Models*
  • Models, Genetic*