Likelihood-Free Inference in High-Dimensional Models

Genetics. 2016 Jun;203(2):893-904. doi: 10.1534/genetics.116.187567. Epub 2016 Apr 6.

Abstract

Methods that bypass analytical evaluations of the likelihood function have become an indispensable tool for statistical inference in many fields of science. These so-called likelihood-free methods rely on accepting and rejecting simulations based on summary statistics, which limits them to low-dimensional models for which the value of the likelihood is large enough to result in manageable acceptance rates. To get around these issues, we introduce a novel, likelihood-free Markov chain Monte Carlo (MCMC) method combining two key innovations: updating only one parameter per iteration and accepting or rejecting this update based on subsets of statistics approximately sufficient for this parameter. This increases acceptance rates dramatically, rendering this approach suitable even for models of very high dimensionality. We further derive that for linear models, a one-dimensional combination of statistics per parameter is sufficient and can be found empirically with simulations. Finally, we demonstrate that our method readily scales to models of very high dimensionality, using toy models as well as by jointly inferring the effective population size, the distribution of fitness effects (DFE) of segregating mutations, and selection coefficients for each locus from data of a recent experiment on the evolution of drug resistance in influenza.

Keywords: Markov chain Monte Carlo; approximate Bayesian computation; distribution of fitness effects; hierarchical models; high dimensions.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Drug Resistance, Viral / genetics*
  • Genetic Fitness
  • Genetic Loci
  • Models, Genetic*
  • Mutation
  • Orthomyxoviridae / drug effects
  • Orthomyxoviridae / genetics
  • Probability
  • Selection, Genetic