Communication-efficient federated learning with stagewise training strategy

Yifei Cheng; Shuheng Shen; Xianfeng Liang; Jingchang Liu; Joya Chen; Tie Zhang; Enhong Chen

doi:10.1016/j.neunet.2023.08.033

Communication-efficient federated learning with stagewise training strategy

Neural Netw. 2023 Oct:167:460-472. doi: 10.1016/j.neunet.2023.08.033. Epub 2023 Sep 1.

Authors

Yifei Cheng¹, Shuheng Shen², Xianfeng Liang³, Jingchang Liu⁴, Joya Chen⁵, Tie Zhang⁶, Enhong Chen⁷

Affiliations

¹ Anhui Province Key Lab of Big Data Analysis and Application, China; School of Data Science, University of Science and Technology of China, China; State Key Laboratory of Cognitive Intelligence, China. Electronic address: chengyif@mail.ustc.edu.cn.
² The Ant Financial Services Group, China. Electronic address: shuheng.ssh@antgroup.com.
³ Anhui Province Key Lab of Big Data Analysis and Application, China; State Key Laboratory of Cognitive Intelligence, China; School of Computer Science, University of Science and Technology of China, China. Electronic address: zeroxf@mail.ustc.edu.cn.
⁴ The Department of Computer Science and Engineering, Hong Kong University of Science and Technology, China. Electronic address: jliude@cse.ust.hk.
⁵ Anhui Province Key Lab of Big Data Analysis and Application, China; State Key Laboratory of Cognitive Intelligence, China; School of Computer Science, University of Science and Technology of China, China. Electronic address: ChenJoya@mail.ustc.edu.cn.
⁶ Anhui Province Key Lab of Big Data Analysis and Application, China; School of Computer Science, University of Science and Technology of China, China. Electronic address: tiezhang@mail.ustc.edu.cn.
⁷ Anhui Province Key Lab of Big Data Analysis and Application, China; School of Data Science, University of Science and Technology of China, China; State Key Laboratory of Cognitive Intelligence, China; School of Computer Science, University of Science and Technology of China, China. Electronic address: cheneh@ustc.edu.cn.

PMID: 37683460
DOI: 10.1016/j.neunet.2023.08.033

Abstract

The efficiency of communication across workers is a significant factor that affects the performance of federated learning. Though periodic communication strategy is applied to reduce communication rounds in training, the communication cost is still high when the training data distributions are not independently and identically distributed (non-IID) which is common in federated learning. Recently, some works introduce variance reduction to eliminate the effect caused by non-IID data among workers. Nevertheless the provable optimal communication complexity O(log(ST)) and convergence rate O(1/(ST)) cannot be achieved simultaneously, where S denotes the number of sampled workers in each round and T is the number of iterations. To deal with this dilemma, we propose an optimization algorithm SQUARFA that adopts stagewise training framework coupling with variance reduction and uses a quick-start phase in each loop. Theoretical results show that SQUARFA achieves both optimal convergence rate and communication complexity for both strongly convex objectives and non-convex objectives under PL condition, thus fills the gap mentioned above. Then, a variant of SQUARFA yields the optimal theoretical results for general non-convex objectives. We further extend the technique in SQUARFA to the large batch setting and achieve optimal communication complexity. Experimental results demonstrate the superiority of the proposed algorithms.

Keywords: Communication complexity; Convergence rate; Federated learning; Optimization algorithm.

MeSH terms

Algorithms*
Communication
Humans
Learning*