Empirical investigations into Kruskal-Wallis power studies utilizing Bernstein fits, simulations and medical study datasets

Jeremy S C Clark; Piotr Kulig; Konrad Podsiadło; Kamila Rydzewska; Krzysztof Arabski; Monika Białecka; Krzysztof Safranow; Andrzej Ciechanowicz

doi:10.1038/s41598-023-29308-2

Empirical investigations into Kruskal-Wallis power studies utilizing Bernstein fits, simulations and medical study datasets

Sci Rep. 2023 Feb 9;13(1):2352. doi: 10.1038/s41598-023-29308-2.

Authors

Jeremy S C Clark¹, Piotr Kulig², Konrad Podsiadło², Kamila Rydzewska², Krzysztof Arabski², Monika Białecka³, Krzysztof Safranow⁴, Andrzej Ciechanowicz²

Affiliations

¹ Department of Clinical and Molecular Biochemistry, Pomeranian Medical University, Ul. Powstancow Wlkp. 72, 70-111, Szczecin, Poland. jeremyclarkbio@gmail.com.
² Department of Clinical and Molecular Biochemistry, Pomeranian Medical University, Ul. Powstancow Wlkp. 72, 70-111, Szczecin, Poland.
³ Department of Pharmacokinetics and Therapeutic Drug Monitoring, Pomeranian Medical University, Szczecin, Poland.
⁴ Department of Biochemistry and Medical Chemistry, Pomeranian Medical University, Szczecin, Poland.

Abstract

Bernstein fits implemented into R allow another route for Kruskal-Wallis power-study tool development. Monte-Carlo Kruskal-Wallis power studies were compared with measured power, a Monte-Carlo ANOVA equivalent and with an analytical method, with or without normalization, using four simulated runs, each with 60-100 populations (each population with N = 30,000 from a set of Pearson-type ranges): random selection gave 6300 samples analyzed for predictive power. Three medical-study datasets (Dialysis/systolic blood pressure; Diabetes/sleep-hours; Marital-status/high-density-lipoprotein cholesterol) were also analyzed. In three from four simulated runs (run_one, run_one_relaxed, and run_three) with Pearson types pooled, Monte-Carlo Kruskal-Wallis gave predicted sample sizes significantly slightly lower than measured but more accurate than with ANOVA methods; the latter gave high sample-size predictions. Populations (run_one_relaxed) with ANOVA assumptions invalid gave Kruskal-Wallis predictions similar to those measured. In two from three medical studies, Kruskal-Wallis predictions (Dialysis: similar predictions; Marital: higher than measured) were more accurate than ANOVA (both higher than measured) but in one (Diabetes) the reverse was found (Kruskal-Wallis: lower; Monte-Carlo ANOVA: similar to measured). These preliminary studies appear to show that Monte-Carlo Kruskal-Wallis power studies, based on Bernstein fits, might perform better than ANOVA equivalents in many settings (and provide reasonable results when ANOVA cannot be used); and both Monte-Carlo methods appeared to be considerably more accurate than the analytical version analyzed.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Computer Simulation
Monte Carlo Method
Renal Dialysis*
Sample Size