Finding Roles of Players in Football Using Automatic Particle Swarm Optimization-Clustering Algorithm

Big Data. 2019 Mar;7(1):35-56. doi: 10.1089/big.2018.0069. Epub 2019 Feb 15.

Abstract

Recently, professional team sport organizations have invested their resources to analyze their own and opponents' performance. So, developing methods and algorithms for analyzing team sports has become one of the most popular topics among data scientists. Analyzing football is hard because of its complexity, number of events in each match, and constant flow of circulation of the ball. Finding roles of players with the purpose of analyzing the performance of a team or making a meaningful comparison between players is crucial. In this article, an automatic big data clustering method, based on a swarm intelligence algorithm, is proposed to automatically cluster the data set of players' performance centers in different matches and extract different kinds of roles in football. The proposed method created using particle swarm optimization algorithm has two phases. In the first phase, the algorithm searches the solution space to find the number of clusters and, in the second phase, it finds the positions of the centroids. To show the effectiveness of the algorithm, it is tested on six synthetic data sets and its performance is compared with two other conventional clustering methods. After that, the algorithm is used to find clusters of a data set containing 93,000 objects, which are the centers of players' performance in about 4900 matches in different European leagues.

Keywords: big data clustering; football analysis; particle swarm optimization; swarm intelligence.

MeSH terms

  • Algorithms*
  • Cluster Analysis
  • Humans
  • Professional Role*
  • Soccer*