Clustering Cities over Features Extracted from Multiple Virtual Sensors Measuring Micro-Level Activity Patterns Allows One to Discriminate Large-Scale City Characteristics

Sensors (Basel). 2023 May 29;23(11):5165. doi: 10.3390/s23115165.

Abstract

The impact of micro-level people's activities on urban macro-level indicators is a complex question that has been the subject of much interest among researchers and policymakers. Transportation preferences, consumption habits, communication patterns and other individual-level activities can significantly impact large-scale urban characteristics, such as the potential for innovation generation of the city. Conversely, large-scale urban characteristics can also constrain and determine the activities of their inhabitants. Therefore, understanding the interdependence and mutual reinforcement between micro- and macro-level factors is critical to defining effective public policies. The increasing availability of digital data sources, such as social media and mobile phones, has opened up new opportunities for the quantitative study of this interdependency. This paper aims to detect meaningful city clusters on the basis of a detailed analysis of the spatiotemporal activity patterns for each city. The study is carried out on a worldwide city dataset of spatiotemporal activity patterns obtained from geotagged social media data. Clustering features are obtained from unsupervised topic analyses of activity patterns. Our study compares state-of-the-art clustering models, selecting the model achieving a 2.7% greater Silhouette Score than the next-best model. Three well-separated city clusters are identified. Additionally, the study of the distribution of the City Innovation Index over these three city clusters shows discrimination of low performing from high performing cities relative to innovation. Low performing cities are identified in one well-separated cluster. Therefore, it is possible to correlate micro-scale individual-level activities to large-scale urban characteristics.

Keywords: City Innovation Index; activity patterns; clustering models; geotagged data.

MeSH terms

  • Cities
  • Cluster Analysis
  • Humans
  • Time Factors
  • Transportation*

Grants and funding

This work would not have been accomplished without the financial support of CONICYT-PFCHA/DOCTORADO BECAS CHILE/2019-21190345. The last author received research funds from the Basque Government as the head of the Grupo de Inteligencia Computacional, Universidad del Pais Vasco, UPV/EHU, from 2007 until 2025. The current code for the grant is IT1689-22. Additionally, the author participates in Elkartek projects KK-2022/00051 and KK-2021/00070. The Spanish MCIN has also granted the author a research project under code PID2020-116346GB-I00.