SpatialSim: Recognizing Spatial Configurations of Objects With Graph Neural Networks

Front Artif Intell. 2022 Jan 26:4:782081. doi: 10.3389/frai.2021.782081. eCollection 2021.

Abstract

An embodied, autonomous agent able to set its own goals has to possess geometrical reasoning abilities for judging whether its goals have been achieved, namely it should be able to identify and discriminate classes of configurations of objects, irrespective of its point of view on the scene. However, this problem has received little attention so far in the deep learning literature. In this paper we make two key contributions. First, we propose SpatialSim (Spatial Similarity), a novel geometrical reasoning diagnostic dataset, and argue that progress on this benchmark would allow for diagnosing more principled approaches to this problem. This benchmark is composed of two tasks: "Identification" and "Discrimination," each one instantiated in increasing levels of difficulty. Secondly, we validate that relational inductive biases-exhibited by fully-connected message-passing Graph Neural Networks (MPGNNs)-are instrumental to solve those tasks, and show their advantages over less relational baselines such as Deep Sets and unstructured models such as Multi-Layer Perceptrons. We additionally showcase the failure of high-capacity CNNs on the hard Discrimination task. Finally, we highlight the current limits of GNNs in both tasks.

Keywords: artificial intelligence; graph neural net; machine learning; neural networks; similarity learning; spatial reasoning; structured representation.