Exact Distribution of Linkage Disequilibrium in the Presence of Mutation, Selection, or Minor Allele Frequency Filtering

Front Genet. 2020 Apr 21:11:362. doi: 10.3389/fgene.2020.00362. eCollection 2020.

Abstract

Linkage disequilibrium (LD), often expressed in terms of the squared correlation (r 2) between allelic values at two loci, is an important concept in many branches of genetics and genomics. Genetic drift and recombination have opposite effects on LD, and thus r 2 will keep changing until the effects of these two forces are counterbalanced. Several approximations have been used to determine the expected value of r 2 at equilibrium in the presence or absence of mutation. In this paper, we propose a probability-based approach to compute the exact distribution of allele frequencies at two loci in a finite population at any generation t conditional on the distribution at generation t - 1. As r 2 is a function of this distribution of allele frequencies, this approach can be used to examine the distribution of r 2 over generations as it approaches equilibrium. The exact distribution of LD from our method is used to describe, quantify, and compare LD at different equilibria, including equilibrium in the absence or presence of mutation, selection, and filtering by minor allele frequency. We also propose a deterministic formula for expected LD in the presence of mutation at equilibrium based on the exact distribution of LD.

Keywords: effective population size; linkage disequilibrium; minor allele frequency filtering; mutation rate; selection.