Bayesian3 Active Learning for the Gaussian Process Emulator Using Information Theory

Sergey Oladyshkin; Farid Mohammadi; Ilja Kroeker; Wolfgang Nowak

doi:10.3390/e22080890

Bayesian³ Active Learning for the Gaussian Process Emulator Using Information Theory

Entropy (Basel). 2020 Aug 13;22(8):890. doi: 10.3390/e22080890.

Authors

Sergey Oladyshkin¹, Farid Mohammadi², Ilja Kroeker¹, Wolfgang Nowak¹

Affiliations

¹ Department of Stochastic Simulation and Safety Research for Hydrosystems, Institute for Modelling Hydraulic and Environmental Systems/SC SimTech, University of Stuttgart, Pfaffenwaldring 5a, 70569 Stuttgart, Germany.
² Department of Hydromechanics and Modelling of Hydrosystems, Institute for Modelling Hydraulic and Environmental Systems/SC SimTech, University of Stuttgart, Pfaffenwaldring 61, 70569 Stuttgart, Germany.

Abstract

Gaussian process emulators (GPE) are a machine learning approach that replicates computational demanding models using training runs of that model. Constructing such a surrogate is very challenging and, in the context of Bayesian inference, the training runs should be well invested. The current paper offers a fully Bayesian view on GPEs for Bayesian inference accompanied by Bayesian active learning (BAL). We introduce three BAL strategies that adaptively identify training sets for the GPE using information-theoretic arguments. The first strategy relies on Bayesian model evidence that indicates the GPE's quality of matching the measurement data, the second strategy is based on relative entropy that indicates the relative information gain for the GPE, and the third is founded on information entropy that indicates the missing information in the GPE. We illustrate the performance of our three strategies using analytical- and carbon-dioxide benchmarks. The paper shows evidence of convergence against a reference solution and demonstrates quantification of post-calibration uncertainty by comparing the introduced three strategies. We conclude that Bayesian model evidence-based and relative entropy-based strategies outperform the entropy-based strategy because the latter can be misleading during the BAL. The relative entropy-based strategy demonstrates superior performance to the Bayesian model evidence-based strategy.

Keywords: Bayesian inference; Bayesian model evidence; Gaussian process emulator; Kullback–Leibler divergence; active learning; information entropy; machine learning; relative entropy.

Abstract

Grants and funding