Pioneering a Multi-Phase Framework to Harmonize Self-Reported Sleep Data Across Cohorts

Sleep. 2024 May 16:zsae115. doi: 10.1093/sleep/zsae115. Online ahead of print.

Abstract

Study objectives: Harmonizing and aggregating data across studies enable pooled analyses that support external validation and enhance replicability and generalizability. However, the multidimensional nature of sleep poses challenges for data harmonization and aggregation. Here we describe and implement our process for harmonizing self-reported sleep data.

Methods: We established a multi-phase framework to harmonize self-reported sleep data: (1) compile items; (2) group items into domains; (3) harmonize items; and (4) evaluate harmonizability. We applied this process to produce a pooled multi-cohort sample of five United States cohorts plus a separate yet fully harmonized sample from Rotterdam, Netherlands. Sleep and sociodemographic data are described and compared to demonstrate the utility of harmonization and aggregation.

Results: We collected 190 unique self-reported sleep items and grouped them into 15 conceptual domains. Using these domains as guiderails, we developed 14 harmonized items measuring aspects of Satisfaction, Alertness/Sleepiness, Timing, Efficiency, Duration, Insomnia, and Sleep Apnea. External raters determined that 13 of these 14 items had moderate-to-high harmonizability. Alertness/Sleepiness items had lower harmonizability, while continuous, quantitative items (e.g., timing, total sleep time, efficiency) had higher harmonizability. Descriptive statistics identified features that are more consistent (e.g., wake-up time, duration) and more heterogeneous (e.g., time in bed, bedtime) across samples.

Conclusions: Our process can guide researchers and cohort stewards towards effective sleep harmonization and provides a foundation for further methodological development in this expanding field. Broader national and international initiatives promoting common data elements across cohorts are needed to enhance future harmonization and aggregation efforts.

Keywords: data harmonization; data sharing; data standardization; sleep disorder; sleep health; sleepiness.