Amount of information needed for model choice in Approximate Bayesian Computation

PLoS One. 2014 Jun 24;9(6):e99581. doi: 10.1371/journal.pone.0099581. eCollection 2014.

Abstract

Approximate Bayesian Computation (ABC) has become a popular technique in evolutionary genetics for elucidating population structure and history due to its flexibility. The statistical inference framework has benefited from significant progress in recent years. In population genetics, however, its outcome depends heavily on the amount of information in the dataset, whether that be the level of genetic variation or the number of samples and loci. Here we look at the power to reject a simple constant population size coalescent model in favor of a bottleneck model in datasets of varying quality. Not only is this power dependent on the number of samples and loci, but it also depends strongly on the level of nucleotide diversity in the observed dataset. Whilst overall model choice in an ABC setting is fairly powerful and quite conservative with regard to false positives, detecting weaker bottlenecks is problematic in smaller or less genetically diverse datasets and limits the inferences possible in non-model organism where the amount of information regarding the two models is often limited. Our results show it is important to consider these limitations when performing an ABC analysis and that studies should perform simulations based on the size and nature of the dataset in order to fully assess the power of the study.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Computational Biology / methods*
  • Genetic Variation
  • Models, Statistical*
  • Population Density
  • Principal Component Analysis

Grants and funding

This work was supported by the European Community's Seventh Framework Programme (FP7/20072013), under grant agreement 211868 (Project Noveltree) and the Eranet Biodiversa LINKTREE and TIPTREE projects. M. Siol and S. De Mita were funded by Agropolis Fondation. The research trip of M. Stocks to IRD was funded by the EBC graduate school on Genomes and Phenotypes at Uppsala University. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.