Inventory statistics meet big data: complications for estimating numbers of species

PeerJ. 2020 May 13:8:e8872. doi: 10.7717/peerj.8872. eCollection 2020.

Abstract

We point out complications inherent in biodiversity inventory metrics when applied to large-scale datasets. The number of units of inventory effort (e.g., days of inventory effort) in which a species is detected saturates, such that crucial numbers of detections of rare species approach zero. Any rare errors can then come to dominate species richness estimates, creating upward biases in estimates of species numbers. We document the problem via simulations of sampling from virtual biotas, illustrate its potential using a large empirical dataset (bird records from Cape May, NJ, USA), and outline the circumstances under which these problems may be expected to emerge.

Keywords: Chao estimator; Data quality; Species richness; Virtual biotas.

Grants and funding

The authors received no funding for this work.