Population models and simulation methods: The case of the Spearman rank correlation

Br J Math Stat Psychol. 2017 Nov;70(3):347-367. doi: 10.1111/bmsp.12085. Epub 2017 Jan 31.

Abstract

The purpose of this paper is to highlight the importance of a population model in guiding the design and interpretation of simulation studies used to investigate the Spearman rank correlation. The Spearman rank correlation has been known for over a hundred years to applied researchers and methodologists alike and is one of the most widely used non-parametric statistics. Still, certain misconceptions can be found, either explicitly or implicitly, in the published literature because a population definition for this statistic is rarely discussed within the social and behavioural sciences. By relying on copula distribution theory, a population model is presented for the Spearman rank correlation, and its properties are explored both theoretically and in a simulation study. Through the use of the Iman-Conover algorithm (which allows the user to specify the rank correlation as a population parameter), simulation studies from previously published articles are explored, and it is found that many of the conclusions purported in them regarding the nature of the Spearman correlation would change if the data-generation mechanism better matched the simulation design. More specifically, issues such as small sample bias and lack of power of the t-test and r-to-z Fisher transformation disappear when the rank correlation is calculated from data sampled where the rank correlation is the population parameter. A proof for the consistency of the sample estimate of the rank correlation is shown as well as the flexibility of the copula model to encompass results previously published in the mathematical literature.

Keywords: Monte Carlo simulation; Spearman rank correlation; copula; rank correlation; simulation design.

MeSH terms

  • Algorithms
  • Computer Simulation
  • Humans
  • Models, Psychological
  • Models, Statistical
  • Monte Carlo Method
  • Population Density
  • Psychometrics / statistics & numerical data*
  • Selection Bias
  • Statistics, Nonparametric*