Novel three-step pseudo-absence selection technique for improved species distribution modelling

PLoS One. 2013 Aug 13;8(8):e71218. doi: 10.1371/journal.pone.0071218. eCollection 2013.

Abstract

Pseudo-absence selection for spatial distribution models (SDMs) is the subject of ongoing investigation. Numerous techniques continue to be developed, and reports of their effectiveness vary. Because the quality of presence and absence data is key for acceptable accuracy of correlative SDM predictions, determining an appropriate method to characterise pseudo-absences for SDM's is vital. The main methods that are currently used to generate pseudo-absence points are: 1) randomly generated pseudo-absence locations from background data; 2) pseudo-absence locations generated within a delimited geographical distance from recorded presence points; and 3) pseudo-absence locations selected in areas that are environmentally dissimilar from presence points. There is a need for a method that considers both geographical extent and environmental requirements to produce pseudo-absence points that are spatially and ecologically balanced. We use a novel three-step approach that satisfies both spatial and ecological reasons why the target species is likely to find a particular geo-location unsuitable. Step 1 comprises establishing a geographical extent around species presence points from which pseudo-absence points are selected based on analyses of environmental variable importance at different distances. This step gives an ecologically meaningful explanation to the spatial range of background data, as opposed to using an arbitrary radius. Step 2 determines locations that are environmentally dissimilar to the presence points within the distance specified in step one. Step 3 performs K-means clustering to reduce the number of potential pseudo-absences to the desired set by taking the centroids of clusters in the most environmentally dissimilar class identified in step 2. By considering spatial, ecological and environmental aspects, the three-step method identifies appropriate pseudo-absence points for correlative SDMs. We illustrate this method by predicting the New Zealand potential distribution of the Asian tiger mosquito (Aedes albopictus) and the Western corn rootworm (Diabrotica virgifera virgifera).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Culicidae
  • Ecosystem
  • Environment
  • Models, Statistical*
  • New Zealand
  • Population Dynamics
  • Reproducibility of Results
  • Spatial Analysis*

Grants and funding

This research was supported by a PhD and writing scholarship award to SS in Lincoln University by the Bio-Protection Research Centre. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.