Learning Performance of Weighted Distributed Learning With Support Vector Machines

IEEE Trans Cybern. 2023 Jul;53(7):4630-4641. doi: 10.1109/TCYB.2021.3131424. Epub 2023 Jun 15.

Abstract

The divide-and-conquer strategy is a very effective method of dealing with big data. Noisy samples in big data usually have a great impact on algorithmic performance. In this article, we introduce Markov sampling and different weights for distributed learning with the classical support vector machine (cSVM). We first estimate the generalization error of weighted distributed cSVM algorithm with uniformly ergodic Markov chain (u.e.M.c.) samples and obtain its optimal convergence rate. As applications, we obtain the generalization bounds of weighted distributed cSVM with strong mixing observations and independent and identically distributed (i.i.d.) samples, respectively. We also propose a novel weighted distributed cSVM based on Markov sampling (DM-cSVM). The numerical studies of benchmark datasets show that the DM-cSVM algorithm not only has better performance but also has less total time of sampling and training compared to other distributed algorithms.

MeSH terms

  • Algorithms*
  • Learning
  • Markov Chains
  • Support Vector Machine*