Estimating the variance of Shannon entropy

Leonardo Ricci; Alessio Perinelli; Michele Castelluzzo

doi:10.1103/PhysRevE.104.024220

Estimating the variance of Shannon entropy

Phys Rev E. 2021 Aug;104(2-1):024220. doi: 10.1103/PhysRevE.104.024220.

Authors

Leonardo Ricci^{1

2}, Alessio Perinelli², Michele Castelluzzo¹

Affiliations

¹ Department of Physics, University of Trento, 38123 Trento, Italy.
² CIMeC, Center for Mind/Brain Sciences, University of Trento, 38068 Rovereto, Italy.

PMID: 34525589
DOI: 10.1103/PhysRevE.104.024220

Abstract

The statistical analysis of data stemming from dynamical systems, including, but not limited to, time series, routinely relies on the estimation of information theoretical quantities, most notably Shannon entropy. To this purpose, possibly the most widespread tool is provided by the so-called plug-in estimator, whose statistical properties in terms of bias and variance were investigated since the first decade after the publication of Shannon's seminal works. In the case of an underlying multinomial distribution, while the bias can be evaluated by knowing support and data set size, variance is far more elusive. The aim of the present work is to investigate, in the multinomial case, the statistical properties of an estimator of a parameter that describes the variance of the plug-in estimator of Shannon entropy. We then exactly determine the probability distributions that maximize that parameter. The results presented here allow one to set upper limits to the uncertainty of entropy assessments under the hypothesis of memoryless underlying stochastic processes.