Genome-Wide Association Study Statistical Models: A Review

Methods Mol Biol. 2022:2481:43-62. doi: 10.1007/978-1-0716-2237-7_4.

Abstract

Statistical models are at the core of the genome-wide association study (GWAS). In this chapter, we provide an overview of single- and multilocus statistical models, Bayesian, and machine learning approaches for association studies in plants. These models are discussed based on their basic methodology, cofactors adjustment accounted for, statistical power and computational efficiency. New statistical models and machine learning algorithms are both showing improved performance in detecting missed signals, rare mutations and prioritizing causal genetic variants; nevertheless, further optimization and validation studies are required to maximize the power of GWAS.

Keywords: Computational efficiency; GWAS; Significance threshold; Statistical models; Statistical power.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Genome-Wide Association Study* / methods
  • Machine Learning
  • Models, Statistical*