Effects of data preprocessing on results of the epidemiological analysis of coronary heart disease and behaviour-related risk factors

Ann Med. 2021 Dec;53(1):890-899. doi: 10.1080/07853890.2021.1921838.

Abstract

Background: We carried out this study to demonstrate the effects of outcome sensitivity, participant exclusions, and covariate manipulations on results of the epidemiological analysis of coronary heart disease (CHD) and its behaviour-related risk factors.

Material and methods: Our study population consisted of 1592 54-year-old men, who participated in the Kuopio Ischaemic Heart Disease Risk Factor (KIHD) Study. We used the Cox proportional-hazards model to predict the hazard of CHD and applied different sets of outcomes concerning outcome sensitivity and data preprocessing procedures regarding participant exclusions and covariate manipulations.

Results: The mean follow-up time was 23 years, and 730 men received the CHD diagnosis. Cox regressions based on data with no participant exclusions most often discovered statistically significant associations. Loose inclusion criteria for study participants with any CVD during the follow-up and strict exclusion criteria for participants with no CVD were best in discovering the associations between risk factors and CHD. Outcome sensitivity affected the associations, whereas the covariate type, continuous or categorical, did not.

Conclusions: This study suggests that excluding study participants who are not disease-free at baseline is probably unnecessary for epidemiological analyses. Epidemiological research reports should present results based on no data exclusions together with results based on reasoned exclusions.

Keywords: Categorical covariate; continuous covariate; coronary heart disease; exclusion criterion; outcome sensitivity.

MeSH terms

  • Coronary Disease* / epidemiology
  • Data Analysis
  • Epidemiologic Measurements
  • Humans
  • Male
  • Middle Aged
  • Proportional Hazards Models
  • Risk Factors