An estimator of first coalescent time reveals selection on young variants and large heterogeneity in rare allele ages among human populations

PLoS Genet. 2019 Aug 19;15(8):e1008340. doi: 10.1371/journal.pgen.1008340. eCollection 2019 Aug.

Abstract

Allele age has long been a focus of population genetic research, primarily because it can be an important clue to the fitness effects of an allele. By virtue of their effects on fitness, alleles under directional selection are expected to be younger than neutral alleles of the same frequency. We developed a new coalescent-based estimator of a close proxy for allele age, the time when a copy of an allele first shares common ancestry with other chromosomes in a sample not carrying that allele. The estimator performs well, including for the very rarest of alleles that occur just once in a sample, with a bias that is typically negative. The estimator is mostly insensitive to population demography and to factors that can arise in population genomic pipelines, including the statistical phasing of chromosomes. Applications to 1000 Genomes Data and UK10K genome data confirm predictions that singleton alleles that alter proteins are significantly younger than those that do not, with a greater difference in the larger UK10K dataset, as expected. The 1000 Genomes populations varied markedly in their distributions for singleton allele ages, suggesting that these distributions can be used to inform models of demographic history, including recent events that are only revealed by their impacts on the ages of very rare alleles.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Alleles
  • Biological Variation, Population / genetics
  • Datasets as Topic
  • Evolution, Molecular*
  • Female
  • Gene Frequency
  • Genetic Heterogeneity
  • Genetics, Population / methods*
  • Genome, Human*
  • Humans
  • Male
  • Models, Genetic*
  • Selection, Genetic*
  • Time Factors