Scenario-Based Verification of Uncertain MDPs

Murat Cubuktepe; Nils Jansen; Sebastian Junges; Joost-Pieter Katoen; Ufuk Topcu

doi:10.1007/978-3-030-45190-5_16

Scenario-Based Verification of Uncertain MDPs

Tools Algorithms Constr Anal Syst I (2020). 2020 Apr:12078:287-305. doi: 10.1007/978-3-030-45190-5_16. Epub 2020 Apr 17.

Authors

Murat Cubuktepe¹, Nils Jansen², Sebastian Junges³, Joost-Pieter Katoen³, Ufuk Topcu¹

Affiliations

¹ The University of Texas at Austin, Austin, USA.
² Radboud University Nijmegen, Nijmegen, The Netherlands.
³ RWTH Aachen University, Aachen, Germany.

Abstract

We consider Markov decision processes (MDPs) in which the transition probabilities and rewards belong to an uncertainty set parametrized by a collection of random variables. The probability distributions for these random parameters are unknown. The problem is to compute the probability to satisfy a temporal logic specification within any MDP that corresponds to a sample from these unknown distributions. In general, this problem is undecidable, and we resort to techniques from so-called scenario optimization. Based on a finite number of samples of the uncertain parameters, each of which induces an MDP, the proposed method estimates the probability of satisfying the specification by solving a finite-dimensional convex optimization problem. The number of samples required to obtain a high confidence on this estimate is independent from the number of states and the number of random parameters. Experiments on a large set of benchmarks show that a few thousand samples suffice to obtain high-quality confidence bounds with a high probability.

Keywords: MDP; Scenario optimisation; Uncertainty; Verification.

Grants and funding

80NSSC19K0209/ImNASA/Intramural NASA/United States