Multivariate tests based on interpoint distances with application to magnetic resonance imaging

Stat Methods Med Res. 2016 Dec;25(6):2593-2610. doi: 10.1177/0962280214529104. Epub 2014 Apr 16.

Abstract

The multivariate location problem is addressed. The most familiar method to address the problem is the Hotelling test. When the hypothesis of normal distributions holds, the Hotelling test is optimal. Unfortunately, in practice the distributions underlying the samples are generally unknown and without assuming normality the finite sample unbiasedness of the Hotelling test is not guaranteed. Moreover, high-dimensional data are increasingly encountered when analyzing medical and biological problems, and in these situations the Hotelling test performs poorly or cannot be computed. A test that is unbiased for non-normal data, for small sample sizes as well as for two-sided alternatives and that can be computed for high-dimensional data has been recently proposed and is based on the ranks of the interpoint Euclidean distances between observations. Five modifications of this test are proposed and compared to the original test and the Hotelling test. Unbiasedness and consistency of the tests are proven and the problem of power computation is addressed. It is shown that two of the modified interpoint distance-based tests are always more powerful than the original test. Particularly, the modified test based on the Tippett criterium is suggested when the assumption of normality is not tenable and/or in case of high-dimensional data with complex dependence structure which are typical in molecular biology and medical imaging. A practical application to a case-control study where functional magnetic resonance imaging is used is discussed.

Keywords: case-control study; high-dimensional data; hypothesis testing; interpoint distance; nonparametric tests.

MeSH terms

  • Adult
  • Case-Control Studies
  • Coronary Circulation
  • Humans
  • Magnetic Resonance Imaging / methods*
  • Male
  • Multivariate Analysis*
  • Sample Size
  • Smokers
  • Statistics, Nonparametric