Geostatistical estimation and prediction for censored responses

Spat Stat. 2018 Mar:23:109-123. doi: 10.1016/j.spasta.2017.12.001. Epub 2017 Dec 12.

Abstract

Spatially-referenced geostatistical responses that are collected in environmental sciences research are often subject to detection limits, where the measures are not fully quantifiable. This leads to censoring (left, right, interval, etc), and various ad hoc statistical methods (such as choosing arbitrary detection limits, or data augmentation) are routinely employed during subsequent statistical analysis for inference and prediction. However, inference may be imprecise and sensitive to the assumptions and approximations involved in those arbitrary choices. To circumvent this, we propose an exact maximum likelihood estimation framework of the fixed effects and variance components and related prediction via a novel application of the Stochastic Approximation of the Expectation Maximization (SAEM) algorithm, allowing for easy and elegant estimation of model parameters under censoring. Both simulation studies and application to a real dataset on arsenic concentration collected by the Michigan Department of Environmental Quality demonstrate the advantages of our method over the available naïve techniques in terms of finite sample properties of the estimates, prediction, and robustness. The proposed methods can be implemented using the R package CensSpatial.

Keywords: Censored geostatistical data; Kriging; Limit of detection (LOD); SAEM algorithm.