Exact Inference for Hardy-Weinberg Proportions with Missing Genotypes: Single and Multiple Imputation

G3 (Bethesda). 2015 Sep 15;5(11):2365-73. doi: 10.1534/g3.115.022111.

Abstract

This paper addresses the issue of exact-test based statistical inference for Hardy-Weinberg equilibrium in the presence of missing genotype data. Missing genotypes often are discarded when markers are tested for Hardy-Weinberg equilibrium, which can lead to bias in the statistical inference about equilibrium. Single and multiple imputation can improve inference on equilibrium. We develop tests for equilibrium in the presence of missingness by using both inbreeding coefficients (or, equivalently, χ(2) statistics) and exact p-values. The analysis of a set of markers with a high missing rate from the GENEVA project on prematurity shows that exact inference on equilibrium can be altered considerably when missingness is taken into account. For markers with a high missing rate (>5%), we found that both single and multiple imputation tend to diminish evidence for Hardy-Weinberg disequilibrium. Depending on the imputation method used, 6-13% of the test results changed qualitatively at the 5% level.

Keywords: Hardy−Weinberg equilibrium; exact test; imputation; missing data.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Data Accuracy
  • Genetics, Population / methods
  • Inbreeding
  • Linkage Disequilibrium*
  • Models, Genetic*