A multi-robot deep Q-learning framework for priority-based sanitization of railway stations

Riccardo Caccavale; Mirko Ermini; Eugenio Fedeli; Alberto Finzi; Vincenzo Lippiello; Fabrizio Tavano

doi:10.1007/s10489-023-04529-0

A multi-robot deep Q-learning framework for priority-based sanitization of railway stations

Appl Intell (Dordr). 2023 Apr 18:1-19. doi: 10.1007/s10489-023-04529-0. Online ahead of print.

Authors

Riccardo Caccavale^#¹, Mirko Ermini^#², Eugenio Fedeli^#³, Alberto Finzi^#¹, Vincenzo Lippiello^#¹, Fabrizio Tavano^#^{1

4}

Affiliations

¹ Department DIETI, Università degli Study di Napoli "Federico II", via Claudio 21, Naples, 80125 Italy.
² Department Research and Development, Rete Ferroviaria Italiana, Via Curzio Malaparte 8, Firenze Osmannoro, 50145 Italy.
³ Department Research and Development, Rete Ferroviaria Italiana, Piazza della Croce Rossa 1, Roma, 00161 Italy.
⁴ Department Research and Development, Rete Ferroviaria Italiana, Via del Portonaccio 175, Roma, 00159 Italy.

^# Contributed equally.

Abstract

Sanitizing railway stations is a relevant issue, primarily due to the recent evolution of the Covid-19 pandemic. In this work, we propose a multi-robot approach to sanitize railway stations based on a distributed Deep Q-Learning technique. The proposed framework relies on anonymous data from existing WiFi networks to dynamically estimate crowded areas within the station and to develop a heatmap of prioritized areas to be sanitized. Such heatmap is then provided to a team of cleaning robots - each endowed with a robot-specific convolutional neural network - that learn how to effectively cooperate and sanitize the station's areas according to the associated priorities. The proposed approach is evaluated in a realistic simulation scenario provided by the Italian largest railways station: Roma Termini. In this setting, we consider different case studies to assess how the approach scales with the number of robots and how the trained system performs with a real dataset retrieved from a one-day data recording of the station's WiFi network.

Keywords: Convolutional neural network; Decentralized; Deep Q-network; Heatmap; Multi-agent; Sanitization.