Mutation Rate Variation is a Primary Determinant of the Distribution of Allele Frequencies in Humans

PLoS Genet. 2016 Dec 15;12(12):e1006489. doi: 10.1371/journal.pgen.1006489. eCollection 2016 Dec.

Abstract

The site frequency spectrum (SFS) has long been used to study demographic history and natural selection. Here, we extend this summary by examining the SFS conditional on the alleles found at the same site in other species. We refer to this extension as the "phylogenetically-conditioned SFS" or cSFS. Using recent large-sample data from the Exome Aggregation Consortium (ExAC), combined with primate genome sequences, we find that human variants that occurred independently in closely related primate lineages are at higher frequencies in humans than variants with parallel substitutions in more distant primates. We show that this effect is largely due to sites with elevated mutation rates causing significant departures from the widely-used infinite sites mutation model. Our analysis also suggests substantial variation in mutation rates even among mutations involving the same nucleotide changes. In summary, we show that variable mutation rates are key determinants of the SFS in humans.

MeSH terms

  • Alleles
  • Amino Acid Substitution / genetics
  • Animals
  • Base Sequence
  • Chromosome Mapping
  • DNA Methylation / genetics
  • Exome / genetics
  • Gene Frequency / genetics
  • Genetics, Population*
  • Humans
  • Mutation
  • Mutation Rate*
  • Phylogeny*
  • Pongo / genetics
  • Primates / genetics
  • Selection, Genetic / genetics*