Exploiting node metadata to predict interactions in bipartite networks using graph embedding and neural networks

R Soc Open Sci. 2022 Aug 24;9(8):220079. doi: 10.1098/rsos.220079. eCollection 2022 Aug.

Abstract

Networks are increasingly used in various fields to represent systems with the aim of understanding the underlying rules governing observed interactions, and hence predict how the system is likely to behave in the future. Recent developments in network science highlight that accounting for node metadata improves both our understanding of how nodes interact with one another, and the accuracy of link prediction. However, to predict interactions in a network within existing statistical and machine learning frameworks, we need to learn objects that rapidly grow in dimension with the number of nodes. Thus, the task becomes computationally and conceptually challenging for networks. Here, we present a new predictive procedure combining a statistical, low-rank graph embedding method with machine learning techniques which reduces substantially the complexity of the learning task and allows us to efficiently predict interactions from node metadata in bipartite networks. To illustrate its application on real-world data, we apply it to a large dataset of tourist visits across a country. We found that our procedure accurately reconstructs existing interactions and predicts new interactions in the network. Overall, both from a network science and data science perspective, our work offers a flexible and generalizable procedure for link prediction.

Keywords: Random Dot Product Graphs; graph embedding; link prediction; machine learning; metadata; predictive models.

Associated data

  • figshare/10.6084/m9.figshare.c.6161532