Robust estimation of risks from small samples

Philos Trans A Math Phys Eng Sci. 2017 Aug 13;375(2100):20160299. doi: 10.1098/rsta.2016.0299.

Abstract

Data-driven risk analysis involves the inference of probability distributions from measured or simulated data. In the case of a highly reliable system, such as the electricity grid, the amount of relevant data is often exceedingly limited, but the impact of estimation errors may be very large. This paper presents a robust non-parametric Bayesian method to infer possible underlying distributions. The method obtains rigorous error bounds even for small samples taken from ill-behaved distributions. The approach taken has a natural interpretation in terms of the intervals between ordered observations, where allocation of probability mass across intervals is well specified, but the location of that mass within each interval is unconstrained. This formulation gives rise to a straightforward computational resampling method: Bayesian interval sampling. In a comparison with common alternative approaches, it is shown to satisfy strict error bounds even for ill-behaved distributions.This article is part of the themed issue 'Energy management: flexibility, risk and optimization'.

Keywords: Bayesian inference; Dirichlet process; imprecise probabilities; non-parametric methods; rare event analysis; resampling methods.