Learning-assisted theorem proving with millions of lemmas

Cezary Kaliszyk; Josef Urban

doi:10.1016/j.jsc.2014.09.032

Learning-assisted theorem proving with millions of lemmas

J Symb Comput. 2015 Jul:69:109-128. doi: 10.1016/j.jsc.2014.09.032.

Authors

Cezary Kaliszyk¹, Josef Urban²

Affiliations

¹ University of Innsbruck, Austria.
² Radboud University, Nijmegen, Netherlands.

Abstract

Large formal mathematical libraries consist of millions of atomic inference steps that give rise to a corresponding number of proved statements (lemmas). Analogously to the informal mathematical practice, only a tiny fraction of such statements is named and re-used in later proofs by formal mathematicians. In this work, we suggest and implement criteria defining the estimated usefulness of the HOL Light lemmas for proving further theorems. We use these criteria to mine the large inference graph of the lemmas in the HOL Light and Flyspeck libraries, adding up to millions of the best lemmas to the pool of statements that can be re-used in later proofs. We show that in combination with learning-based relevance filtering, such methods significantly strengthen automated theorem proving of new conjectures over large formal mathematical libraries such as Flyspeck.

Keywords: Artificial intelligence; Flyspeck; Lemma mining; Machine learning.