Statistical and Network-Based Analysis of Italian COVID-19 Data: Communities Detection and Temporal Evolution

Int J Environ Res Public Health. 2020 Jun 12;17(12):4182. doi: 10.3390/ijerph17124182.

Abstract

The coronavirus disease (COVID-19) outbreak started in Wuhan, China, and it has rapidly spread across the world. Italy is one of the European countries most affected by COVID-19, and it has registered high COVID-19 death rates and the death toll. In this article, we analyzed different Italian COVID-19 data at the regional level for the period 24 February to 29 March 2020. The analysis pipeline includes the following steps. After individuating groups of similar or dissimilar regions with respect to the ten types of available COVID-19 data using statistical test, we built several similarity matrices. Then, we mapped those similarity matrices into networks where nodes represent Italian regions and edges represent similarity relationships (edge length is inversely proportional to similarity). Then, network-based analysis was performed mainly discovering communities of regions that show similar behavior. In particular, network-based analysis was performed by running several community detection algorithms on those networks and by underlying communities of regions that show similar behavior. The network-based analysis of Italian COVID-19 data is able to elegantly show how regions form communities, i.e., how they join and leave them, along time and how community consistency changes along time and with respect to the different available data.

Keywords: COVID-19; community detection; network analysis.

MeSH terms

  • Betacoronavirus
  • COVID-19
  • Coronavirus Infections / epidemiology*
  • Data Interpretation, Statistical
  • Hospitalization / trends*
  • Humans
  • Italy / epidemiology
  • Pandemics
  • Pneumonia, Viral / epidemiology*
  • SARS-CoV-2
  • Spatio-Temporal Analysis