Dynamic-Fusion-Based Federated Learning for COVID-19 Detection

IEEE Internet Things J. 2021 Feb 4;8(21):15884-15891. doi: 10.1109/JIOT.2021.3056185. eCollection 2021 Nov 1.

Abstract

Medical diagnostic image analysis (e.g., CT scan or X-Ray) using machine learning is an efficient and accurate way to detect COVID-19 infections. However, the sharing of diagnostic images across medical institutions is usually prohibited due to patients' privacy concerns. This causes the issue of insufficient data sets for training the image classification model. Federated learning is an emerging privacy-preserving machine learning paradigm that produces an unbiased global model based on the received local model updates trained by clients without exchanging clients' local data. Nevertheless, the default setting of federated learning introduces a huge communication cost of transferring model updates and can hardly ensure model performance when severe data heterogeneity of clients exists. To improve communication efficiency and model performance, in this article, we propose a novel dynamic fusion-based federated learning approach for medical diagnostic image analysis to detect COVID-19 infections. First, we design an architecture for dynamic fusion-based federated learning systems to analyze medical diagnostic images. Furthermore, we present a dynamic fusion method to dynamically decide the participating clients according to their local model performance and schedule the model fusion based on participating clients' training time. In addition, we summarize a category of medical diagnostic image data sets for COVID-19 detection, which can be used by the machine learning community for image analysis. The evaluation results show that the proposed approach is feasible and performs better than the default setting of federated learning in terms of model performance, communication efficiency, and fault tolerance.

Keywords: AI; COVID-19; CT; X-Ray; federated learning; image processing; machine learning.

Grants and funding

This work was supported in part by the National Natural Science Foundation of China under Grant 62072469; in part by the National Key Research and Development Program under Grant 2018YFE0116700 and Grant 2020YFB2104301; in part by the Shandong Provincial Natural Science Foundation (Parallel Data-Driven Fault Prediction Under Online–Offline Combined Cloud Computing Environment) under Grant ZR2019MF049; in part by the Fundamental Research Funds for the Central Universities under Grant 2015020031; in part by the Project “PCL Future Greater-Bay Area Network Facilities for Large-Scale Experiments and Applications” under Grant LZC0019; in part by the Special Project of West Coast Artificial Intelligence Technology Innovation Center under Grant 2019-1-5 and Grant 2019-1-6; in part by the Opening Project of Shanghai Trusted Industrial Control Platform under Grant TICPSH202003015-ZC; and in part by the Project “Beihang Beidou Technological Achievements Transformation and Industrialization Funds” under Grant BARI2005.