Statistical Inference in the Wright-Fisher Model Using Allele Frequency Data

Syst Biol. 2017 Jan 1;66(1):e30-e46. doi: 10.1093/sysbio/syw056.

Abstract

The Wright–Fisher model provides an elegant mathematical framework for understanding allele frequency data. In particular, the model can be used to infer the demographic history of species and identify loci under selection. A crucial quantity for inference under the Wright–Fisher model is the distribution of allele frequencies (DAF). Despite the apparent simplicity of the model, the calculation of the DAF is challenging. We review and discuss strategies for approximating the DAF, and how these are used in methods that perform inference from allele frequency data. Various evolutionary forces can be incorporated in the Wright–Fisher model, and we consider these in turn. We begin our review with the basic bi-allelic Wright–Fisher model where random genetic drift is the only evolutionary force. We then consider mutation, migration, and selection. In particular, we compare diffusion-based and moment-based methods in terms of accuracy, computational efficiency, and analytical tractability. We conclude with a brief overview of the multi-allelic process with a general mutation model.

Keywords: Allele frequency; diffusion; inference; moments; selection; Wright–Fisher.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Evolution, Molecular
  • Gene Frequency / genetics*
  • Genetic Drift
  • Models, Genetic*
  • Mutation