Distributed semi-supervised support vector machines

Simone Scardapane; Roberto Fierimonte; Paolo Di Lorenzo; Massimo Panella; Aurelio Uncini

doi:10.1016/j.neunet.2016.04.007

Distributed semi-supervised support vector machines

Neural Netw. 2016 Aug:80:43-52. doi: 10.1016/j.neunet.2016.04.007. Epub 2016 Apr 27.

Authors

Simone Scardapane¹, Roberto Fierimonte², Paolo Di Lorenzo³, Massimo Panella⁴, Aurelio Uncini⁵

Affiliations

¹ Department of Information Engineering, Electronics and Telecommunications (DIET), "Sapienza" University of Rome, Via Eudossiana 18, 00184 Rome, Italy. Electronic address: simone.scardapane@uniroma1.it.
² Department of Information Engineering, Electronics and Telecommunications (DIET), "Sapienza" University of Rome, Via Eudossiana 18, 00184 Rome, Italy. Electronic address: robertofierimonte@gmail.com.
³ Department of Engineering, University of Perugia, Via G. Duranti 93, 06125, Perugia, Italy. Electronic address: paolo.dilorenzo@unipg.it.
⁴ Department of Information Engineering, Electronics and Telecommunications (DIET), "Sapienza" University of Rome, Via Eudossiana 18, 00184 Rome, Italy. Electronic address: massimo.panella@uniroma1.it.
⁵ Department of Information Engineering, Electronics and Telecommunications (DIET), "Sapienza" University of Rome, Via Eudossiana 18, 00184 Rome, Italy. Electronic address: aurelio.uncini@uniroma1.it.

PMID: 27179615
DOI: 10.1016/j.neunet.2016.04.007

Abstract

The semi-supervised support vector machine (S(3)VM) is a well-known algorithm for performing semi-supervised inference under the large margin principle. In this paper, we are interested in the problem of training a S(3)VM when the labeled and unlabeled samples are distributed over a network of interconnected agents. In particular, the aim is to design a distributed training protocol over networks, where communication is restricted only to neighboring agents and no coordinating authority is present. Using a standard relaxation of the original S(3)VM, we formulate the training problem as the distributed minimization of a non-convex social cost function. To find a (stationary) solution in a distributed manner, we employ two different strategies: (i) a distributed gradient descent algorithm; (ii) a recently developed framework for In-Network Nonconvex Optimization (NEXT), which is based on successive convexifications of the original problem, interleaved by state diffusion steps. Our experimental results show that the proposed distributed algorithms have comparable performance with respect to a centralized implementation, while highlighting the pros and cons of the proposed solutions. To the date, this is the first work that paves the way toward the broad field of distributed semi-supervised learning over networks.

Keywords: Distributed learning; Networks; Semi-supervised learning; Support vector machine.

MeSH terms

Algorithms
Supervised Machine Learning*
Support Vector Machine*