A Computational Approach for Modeling the Allele Frequency Spectrum of Populations with Arbitrarily Varying Size

Genomics Proteomics Bioinformatics. 2019 Dec;17(6):635-644. doi: 10.1016/j.gpb.2019.06.002. Epub 2020 Mar 13.

Abstract

The allele frequency spectrum (AFS), or site frequency spectrum, is commonly used to summarize the genomic polymorphism pattern of a sample, which is informative for inferring population history and detecting natural selection. In 2013, Chen and Chen developed a method for analytically deriving the AFS for populations with temporally varying size through the coalescence time-scaling function. However, their approach is only applicable to population history scenarios in which the analytical form of the time-scaling function is tractable. In this paper, we propose a computational approach to extend the method to populations with arbitrary complex varying size by numerically approximating the time-scaling function. We demonstrate the performance of the approach by constructing the AFS for two population history scenarios: the logistic growth model and the Gompertz growth model, for which the AFS are unavailable with existing approaches. Software for implementing the algorithm can be downloaded at http://chenlab.big.ac.cn/software/.

Keywords: Allele frequency spectrum; Coalescent; Complex demography; Population genetic inference; Population history.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Gene Frequency*
  • Genetics, Population / methods*
  • Humans
  • Models, Statistical
  • Polymorphism, Genetic
  • Sample Size