Accounting for spatial sampling patterns in Bayesian phylogeography

Proc Natl Acad Sci U S A. 2021 Dec 28;118(52):e2105273118. doi: 10.1073/pnas.2105273118.

Abstract

Statistical phylogeography provides useful tools to characterize and quantify the spread of organisms during the course of evolution. Analyzing georeferenced genetic data often relies on the assumption that samples are preferentially collected in densely populated areas of the habitat. Deviation from this assumption negatively impacts the inference of the spatial and demographic dynamics. This issue is pervasive in phylogeography. It affects analyses that approximate the habitat as a set of discrete demes as well as those that treat it as a continuum. The present study introduces a Bayesian modeling approach that explicitly accommodates for spatial sampling strategies. An original inference technique, based on recent advances in statistical computing, is then described that is most suited to modeling data where sequences are preferentially collected at certain locations, independently of the outcome of the evolutionary process. The analysis of georeferenced genetic sequences from the West Nile virus in North America along with simulated data shows how assumptions about spatial sampling may impact our understanding of the forces shaping biodiversity across time and space.

Keywords: Bayesian inference; West Nile virus; phylogeography; sampling design; statistical modeling.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Ecosystem
  • Evolution, Molecular
  • Humans
  • Models, Statistical*
  • North America
  • Phylogeography / methods*
  • Population Dynamics*
  • Spatial Analysis
  • West Nile Fever / epidemiology
  • West Nile Fever / virology
  • West Nile virus / genetics