Robust Inference from Conditional Logistic Regression Applied to Movement and Habitat Selection Analysis

PLoS One. 2017 Jan 12;12(1):e0169779. doi: 10.1371/journal.pone.0169779. eCollection 2017.

Abstract

Conditional logistic regression (CLR) is widely used to analyze habitat selection and movement of animals when resource availability changes over space and time. Observations used for these analyses are typically autocorrelated, which biases model-based variance estimation of CLR parameters. This bias can be corrected using generalized estimating equations (GEE), an approach that requires partitioning the data into independent clusters. Here we establish the link between clustering rules in GEE and their effectiveness to remove statistical biases in variance estimation of CLR parameters. The current lack of guidelines is such that broad variation in clustering rules can be found among studies (e.g., 14-450 clusters) with unknown consequences on the robustness of statistical inference. We simulated datasets reflecting conditions typical of field studies. Longitudinal data were generated based on several parameters of habitat selection with varying strength of autocorrelation and some individuals having more observations than others. We then evaluated how changing the number of clusters impacted the effectiveness of variance estimators. Simulations revealed that 30 clusters were sufficient to get unbiased and relatively precise estimates of variance of parameter estimates. The use of destructive sampling to increase the number of independent clusters was successful at removing statistical bias, but only when observations were temporally autocorrelated and the strength of inter-individual heterogeneity was weak. GEE also provided robust estimates of variance for different magnitudes of unbalanced datasets. Our simulations demonstrate that GEE should be estimated by assigning each individual to a cluster when at least 30 animals are followed, or by using destructive sampling for studies with fewer individuals having intermediate level of behavioural plasticity in selection and temporally autocorrelated observations. The simulations provide valuable information to build reliable habitat selection and movement models that allow for robustness of statistical inference without removing excessive amounts of ecological information.

MeSH terms

  • Animal Migration / physiology*
  • Animals
  • Ecosystem*
  • Logistic Models
  • Models, Theoretical*

Grants and funding

Funding was provided by the Natural Sciences and Engineering Research Council of Canada (M-CP) http://www.nserc-crsng.gc.ca and the Fonds de recherche du Québec - Nature et Technologies (TD DF) http://www.frqnt.gouv.qc.ca. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.