Consistency checks to improve measurement with the Hamilton Rating Scale for Anxiety (HAM-A)

Jonathan Rabinowitz; Janet B W Williams; Nanco Hefting; Ariana Anderson; Brianne Brown; Dong Jing Fu; Bashkim Kadriu; Alan Kott; Atul Mahableshwarkar; Jan Sedway; David Williamson; Christian Yavorsky; Nina R Schooler

doi:10.1016/j.jad.2023.01.029

Consistency checks to improve measurement with the Hamilton Rating Scale for Anxiety (HAM-A)

J Affect Disord. 2023 Mar 15:325:429-436. doi: 10.1016/j.jad.2023.01.029. Epub 2023 Jan 10.

Authors

Affiliations

¹ Bar Ilan University, Ramat Gan, Israel. Electronic address: jonathan.rabinowitz@biu.ac.il.
² Columbia University, Department of Psychiatry, c/o 2466 Westlake Ave N., #19, Seattle, WA 98109, USA.
³ Lundbeck A/S, Ottiliavej 9, 2500 Valby, Denmark.
⁴ UCLA, Department of Psychiatry and Biobehavioral Sciences, 760 Westwood Plaza, Ste. 28-224, Los Angeles, CA 90095, USA.
⁵ Janssen Scientific Affairs, LLC, 1125 Trenton-Harbourton Rd, Titusville, NJ 08560, USA.
⁶ Janssen Research & Development, 3210 Merryfield Row, San Diego, CA 92121, United States.
⁷ Signant Health, Prague, Czech Republic.
⁸ ARM Pharma Consulting, Deerfield, MI USA.
⁹ VeraSci, Durham, North, Carolina, USA.
¹⁰ Dept of Neurology & Psychiatry, U of South Alabama College of Medicine, Dept of Psychiatry & Health Behavior, Medical College of Georgia, USA.
¹¹ Valis Bioscience, Berkely, CA, USA.
¹² SUNY Downstate Medical Center, 450 Clarkson Avenue, MSC 1203, Brooklyn, NY, 11203, USA.

PMID: 36638966
DOI: 10.1016/j.jad.2023.01.029

Abstract

Background: Mitigating rating inconsistency can improve measurement fidelity and detection of treatment response.

Methods: The International Society for CNS Clinical Trials and Methodology convened an expert Working Group that developed consistency checks for ratings of the Hamilton Anxiety Rating Scale (HAM-A) and Clinical Global Impression of Severity of anxiety (CGIS) that are widely used in studies of mood and anxiety disorders. Flags were applied to 40,349 HAM-A administrations from 15 clinical trials and to Monte Carlo-simulated data as a proxy for applying flags under conditions of inconsistency.

Results: Thirty-three flags were derived these included logical consistency checks and statistical outlier-response pattern checks. Twenty-percent of the HAM-A administrations had at least one logical scoring inconsistency flag, 4 % had two or more. Twenty-six percent of the administrations had at least one statistical outlier flag and 11 % had two or more. Overall, 35 % of administrations had at least one flag of any type, 19 % had one and 16 % had 2 or more. Most of administrations in the Monte Carlo- simulated data raised multiple flags.

Limitations: Flagged ratings may represent less-common presentations of administrations done correctly. Conclusions-Application of flags to clinical ratings may aid in detecting imprecise measurement. Flags can be used for monitoring of raters during an ongoing trial and as part of post-trial evaluation. Appling flags may improve reliability and validity of trial data.

Keywords: Careless ratings; Consistency of measurement; HAM-A; Hamilton Anxiety Rating Scale; Inconsistent ratings.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Anxiety Disorders* / diagnosis
Anxiety Disorders* / drug therapy
Anxiety*
Humans
Psychiatric Status Rating Scales
Psychometrics
Reproducibility of Results