Expected Shannon Entropy and Shannon Differentiation between Subpopulations for Neutral Genes under the Finite Island Model

PLoS One. 2015 Jun 11;10(6):e0125471. doi: 10.1371/journal.pone.0125471. eCollection 2015.

Abstract

Shannon entropy H and related measures are increasingly used in molecular ecology and population genetics because (1) unlike measures based on heterozygosity or allele number, these measures weigh alleles in proportion to their population fraction, thus capturing a previously-ignored aspect of allele frequency distributions that may be important in many applications; (2) these measures connect directly to the rich predictive mathematics of information theory; (3) Shannon entropy is completely additive and has an explicitly hierarchical nature; and (4) Shannon entropy-based differentiation measures obey strong monotonicity properties that heterozygosity-based measures lack. We derive simple new expressions for the expected values of the Shannon entropy of the equilibrium allele distribution at a neutral locus in a single isolated population under two models of mutation: the infinite allele model and the stepwise mutation model. Surprisingly, this complex stochastic system for each model has an entropy expressable as a simple combination of well-known mathematical functions. Moreover, entropy- and heterozygosity-based measures for each model are linked by simple relationships that are shown by simulations to be approximately valid even far from equilibrium. We also identify a bridge between the two models of mutation. We apply our approach to subdivided populations which follow the finite island model, obtaining the Shannon entropy of the equilibrium allele distributions of the subpopulations and of the total population. We also derive the expected mutual information and normalized mutual information ("Shannon differentiation") between subpopulations at equilibrium, and identify the model parameters that determine them. We apply our measures to data from the common starling (Sturnus vulgaris) in Australia. Our measures provide a test for neutrality that is robust to violations of equilibrium assumptions, as verified on real world data from starlings.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles
  • Animals
  • Avian Proteins / genetics*
  • Entropy
  • Finite Element Analysis
  • Heterozygote
  • Starlings / genetics*
  • Stochastic Processes

Substances

  • Avian Proteins

Grants and funding

The Mathematics Research Center (of Taiwan Ministry of Science and Technology), The Population Biology Foundation, and the Ministry of Science and Technology, Taiwan, Contract 100-2118-M007-006-MY3 (http://www.most.gov.tw/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.