A modified F-test for hypothesis testing in large-scale data

J Biopharm Stat. 2018;28(6):1078-1089. doi: 10.1080/10543406.2018.1436557. Epub 2018 Feb 12.

Abstract

Nowadays, performing simultaneous hypothesis testing under small sample sizes in large-scale features has received unprecedented attention in practice. Traditional approaches are not appropriate for this purpose. Instead, a novel approach which is based on the null statistic is used in the literature. The null statistic approach, using the permutation samples of all features, permits the more accurate estimation of the [Formula: see text]-value for a target feature. However, the known methods, working based on the null statistic, cannot be applied to features with three or more levels. In the present study, an attempt is made to introduce a new permutation test statistic which can be modified to the null statistic approach when features have three or more levels. The robustness of the proposed test statistic and its performance in estimating the [Formula: see text]-value are demonstrated through conducting simulation studies and the analysis of two real datasets.

Keywords: Microarray data; null statistic; one-way ANOVA; permutation test.

MeSH terms

  • Biostatistics / methods*
  • Computer Simulation
  • Data Interpretation, Statistical
  • Female
  • Gene Expression Profiling / methods
  • Gene Expression Profiling / statistics & numerical data*
  • Humans
  • Male
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / methods
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*
  • Research Design / statistics & numerical data*