Mathematical theory of Bayesian statistics for unknown information source

Sumio Watanabe

doi:10.1098/rsta.2022.0151

Mathematical theory of Bayesian statistics for unknown information source

Philos Trans A Math Phys Eng Sci. 2023 May 15;381(2247):20220151. doi: 10.1098/rsta.2022.0151. Epub 2023 Mar 27.

Author

Sumio Watanabe¹

Affiliation

¹ Department of Mathematical and Computing Science, Tokyo Institute of Technology, 2-12-1 Oookayama, Meguro-ku, Tokyo 52-8552, Japan.

PMID: 36970817
DOI: 10.1098/rsta.2022.0151

Abstract

In statistical inference, uncertainty is unknown and all models are wrong. That is to say, a person who makes a statistical model and a prior distribution is simultaneously aware that both are fictional candidates. To study such cases, statistical measures have been constructed, such as cross validation, information criteria and marginal likelihood; however, their mathematical properties have not yet been completely clarified when statistical models are under- or over-parametrized. We introduce a place of mathematical theory of Bayesian statistics for unknown uncertainty, which clarifies general properties of cross validation, information criteria and marginal likelihood, even if an unknown data-generating process is unrealizable by a model or even if the posterior distribution cannot be approximated by any normal distribution. Hence it gives a helpful standpoint for a person who cannot believe in any specific model and prior. This paper consists of three parts. The first is a new result, whereas the second and third are well-known previous results with new experiments. We show there exists a more precise estimator of the generalization loss than leave-one-out cross validation, there exists a more accurate approximation of marginal likelihood than Bayesian information criterion, and the optimal hyperparameters for generalization loss and marginal likelihood are different. This article is part of the theme issue 'Bayesian inference: challenges, perspectives, and prospects'.

Keywords: cross validation; information criterion; marginal likelihood.