A model-based approach to Spotify data analysis: a Beta GLMM

J Appl Stat. 2020 Aug 10;49(1):214-229. doi: 10.1080/02664763.2020.1803810. eCollection 2022.

Abstract

Digital music distribution is increasingly powered by automated mechanisms that continuously capture, sort and analyze large amounts of Web-based data. This paper deals with the management of songs audio features from a statistical point of view. In particular, it explores the data catching mechanisms enabled by Spotify Web API and suggests statistical tools for the analysis of these data. Special attention is devoted to songs popularity and a Beta model, including random effects, is proposed in order to give the first answer to questions like: which are the determinants of popularity? The identification of a model able to describe this relationship, the determination within the set of characteristics of those considered most important in making a song popular is a very interesting topic for those who aim to predict the success of new products.

Keywords: 62; 62H; 62P; Beta GLMM; Spotify web API; audio features; popularity index.