A tree-based modeling approach for matched case-control studies

Stat Med. 2023 Feb 28;42(5):676-692. doi: 10.1002/sim.9637. Epub 2023 Jan 11.

Abstract

Conditional logistic regression (CLR) is the indisputable standard method for the analysis of matched case-control studies. However, CLR is strongly restricted with respect to the inclusion of non-linear effects and interactions of confounding variables. A novel tree-based modeling method is proposed which accounts for this issue and provides a flexible framework allowing for a more complex confounding structure. The proposed machine learning model is fitted within the framework of CLR and, therefore, allows to account for the matched strata in the data. A simulation study demonstrates the efficacy of the method. Furthermore, for illustration the method is applied to a matched case-control study on cervical cancer.

Keywords: CART; conditional inference trees; conditional logistic regression; matched case-control studies; matched pairs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Case-Control Studies
  • Computer Simulation
  • Humans
  • Logistic Models
  • Machine Learning*