Graphlet correlation distance to compare small graphs

PLoS One. 2023 Feb 15;18(2):e0281646. doi: 10.1371/journal.pone.0281646. eCollection 2023.

Abstract

Graph models are standard for representing mutual relationships between sets of entities. Often, graphs deal with a large number of entities with a small number of connections (e.g. social media relationships, infectious disease spread). The distances or similarities between such large graphs are known to be well established by the Graphlet Correlation Distance (GCD). This paper deals with small graphs (with potentially high densities of connections) that have been somewhat neglected in the literature but that concern important fora like sociology, ecology and fisheries, to mention some examples. First, based on numerical experiments, we study the conditions under which Erdős-Rényi, Fitness Scale-Free, Watts-Strogatz small-world and geometric graphs can be distinguished by a specific GCD measure based on 11 orbits, the GCD11. This is done with respect to the density and the order (i.e. the number of nodes) of the graphs when comparing graphs with the same and different orders. Second, we develop a randomization statistical test based on the GCD11 to compare empirical graphs to the four possible null models used in this analysis and apply it to a fishing case study where graphs represent pairwise proximity between fishing vessels. The statistical test rules out independent pairing within the fleet studied which is a standard assumption in fisheries. It also illustrates the difficulty to identify similarities between real-world small graphs and graph models.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Fisheries
  • Mathematics*

Grants and funding

JR, Conseil Regional Pays de la Loire. SM, National research project TRACFLO, National Institute for Ocean Science. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.