An Optimization Method for Non-IID Federated Learning Based on Deep Reinforcement Learning

Xutao Meng; Yong Li; Jianchao Lu; Xianglin Ren

doi:10.3390/s23229226

An Optimization Method for Non-IID Federated Learning Based on Deep Reinforcement Learning

Sensors (Basel). 2023 Nov 16;23(22):9226. doi: 10.3390/s23229226.

Authors

Xutao Meng¹, Yong Li^{1

2

3}, Jianchao Lu⁴, Xianglin Ren¹

Affiliations

¹ School of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China.
² AI Research Institute, Changchun University of Technology, Changchun 130012, China.
³ School of Computer Science and Technology, Jilin University, Changchun 130012, China.
⁴ School of Computing, Macquarie University, Sydney, NSW 2109, Australia.

Abstract

Federated learning (FL) is a distributed machine learning paradigm that enables a large number of clients to collaboratively train models without sharing data. However, when the private dataset between clients is not independent and identically distributed (non-IID), the local training objective is inconsistent with the global training objective, which possibly causes the convergence speed of FL to slow down, or even not converge. In this paper, we design a novel FL framework based on deep reinforcement learning (DRL), named FedRLCS. In FedRLCS, we primarily improved the greedy strategy and action space of the double DQN (DDQN) algorithm, enabling the server to select the optimal subset of clients from a non-IID dataset to participate in training, thereby accelerating model convergence and reaching the target accuracy in fewer communication epochs. In simulation experiments, we partition multiple datasets with different strategies to simulate non-IID on local clients. We adopt four models (LeNet-5, MobileNetV2, ResNet-18, ResNet-34) on the four datasets (CIFAR-10, CIFAR-100, NICO, Tiny ImageNet), respectively, and conduct comparative experiments with five state-of-the-art non-IID FL methods. Experimental results show that FedRLCS reduces the number of communication rounds required by 10-70% with the same target accuracy without increasing the computation and storage costs for all clients.

Keywords: client selection; deep reinforcement learning; federated learning; non-IID.

Grants and funding

JJKH20230766KJ/Jilin Provincial Department of Education in China