Using machine learning to model nontraditional spatial dependence in occupancy data

Narmadha M Mohankumar; Trevor J Hefley

doi:10.1002/ecy.3563

Using machine learning to model nontraditional spatial dependence in occupancy data

Ecology. 2022 Feb;103(2):e03563. doi: 10.1002/ecy.3563. Epub 2021 Dec 22.

Authors

Narmadha M Mohankumar¹, Trevor J Hefley¹

Affiliation

¹ Department of Statistics, Kansas State University, Manhattan, Kansas, USA.

PMID: 34694631
DOI: 10.1002/ecy.3563

Abstract

Spatial models for occupancy data are used to estimate and map the true presence of a species, which may depend on biotic and abiotic factors as well as spatial autocorrelation. Traditionally researchers have accounted for spatial autocorrelation in occupancy data by using a correlated normally distributed site-level random effect, which might be incapable of modeling nontraditional spatial dependence such as discontinuities and abrupt transitions. Machine learning approaches have the potential to model nontraditional spatial dependence, but these approaches do not account for observer errors such as false absences. By combining the flexibility of Bayesian hierarchal modeling and machine learning approaches, we present a general framework to model occupancy data that accounts for both traditional and nontraditional spatial dependence as well as false absences. We demonstrate our framework using six synthetic occupancy data sets and two real data sets. Our results demonstrate how to model both traditional and nontraditional spatial dependence in occupancy data, which enables a broader class of spatial occupancy models that can be used to improve predictive accuracy and model adequacy.

Keywords: hierarchical Bayesian model; machine learning; occupancy model; presence-absence data; site occupancy; spatial dependence; zero-inflated binomial model.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Bayes Theorem
Machine Learning*
Spatial Analysis