Inference and Learning for Generative Capsule Models

Alfredo Nazabal; Nikolaos Tsagkas; Christopher K I Williams

doi:10.1162/neco_a_01564

Inference and Learning for Generative Capsule Models

Neural Comput. 2023 Mar 18;35(4):727-761. doi: 10.1162/neco_a_01564.

Authors

Alfredo Nazabal¹, Nikolaos Tsagkas², Christopher K I Williams^{3

4}

Affiliations

¹ Amazon Development Centre Scotland, Edinburgh EH1 3EG, U.K. alfrena@amazon.com.
² School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, U.K. n.tsagkas@ed.ac.uk.
³ School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, U.K.
⁴ Alan Turing Institute, London NW1 2DB, U.K. c.k.i.williams@ed.ac.uk.

PMID: 36746140
DOI: 10.1162/neco_a_01564

Abstract

Capsule networks (see Hinton et al., 2018) aim to encode knowledge of and reason about the relationship between an object and its parts. In this letter, we specify a generative model for such data and derive a variational algorithm for inferring the transformation of each model object in a scene and the assignments of observed parts to the objects. We derive a learning algorithm for the object models, based on variational expectation maximization (Jordan et al., 1999). We also study an alternative inference algorithm based on the RANSAC method of Fischler and Bolles (1981). We apply these inference methods to data generated from multiple geometric objects like squares and triangles ("constellations") and data from a parts-based model of faces. Recent work by Kosiorek et al. (2019) has used amortized inference via stacked capsule autoencoders to tackle this problem; our results show that we significantly outperform them where we can make comparisons (on the constellations data).