Identifying accurate link predictors based on assortativity of complex networks

Ahmad F Al Musawi; Satyaki Roy; Preetam Ghosh

doi:10.1038/s41598-022-22843-4

Identifying accurate link predictors based on assortativity of complex networks

Sci Rep. 2022 Oct 27;12(1):18107. doi: 10.1038/s41598-022-22843-4.

Authors

Ahmad F Al Musawi^{1

2}, Satyaki Roy³, Preetam Ghosh⁴

Affiliations

¹ Department of Information Technology, University of Thi Qar, Thi Qar, Iraq. almusawiaf@utq.edu.iq.
² Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA. almusawiaf@utq.edu.iq.
³ Department of Genetics, University of North Carolina, Chapel Hill, NC, USA.
⁴ Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.

Abstract

Link prediction algorithms in complex networks, such as social networks, biological networks, drug-drug interactions, communication networks, and so on, assign scores to predict potential links between two nodes. Link prediction (LP) enables researchers to learn unknown, new as well as future interactions among the entities being modeled in the complex networks. In addition to measures like degree distribution, clustering coefficient, centrality, etc., another metric to characterize structural properties is network assortativity which measures the tendency of nodes to connect with similar nodes. In this paper, we explore metrics that effectively predict the links based on the assortativity profiles of the complex networks. To this end, we first propose an approach that generates networks of varying assortativity levels and utilize three sets of link prediction models combining the similarity of neighborhoods and preferential attachment. We carry out experiments to study the LP accuracy (measured in terms of area under the precision-recall curve) of the link predictors individually and in combination with other baseline measures. Our analysis shows that link prediction models that explore a large neighborhood around nodes of interest, such as CH2-L2 and CH2-L3, perform consistently for assortative as well as disassortative networks. While common neighbor-based local measures are effective for assortative networks, our proposed combination of common neighbors with node degree is a good choice for the LP metric in disassortative networks. We discuss how this analysis helps achieve the best-parameterized combination of link prediction models and its significance in the context of link prediction from incomplete social and biological network data.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms*
Cluster Analysis