Reward ignorant modeling of dynamic treatment regimes

Michael P Wallace; Erica E M Moodie; David A Stephens

doi:10.1002/bimj.201700322

Reward ignorant modeling of dynamic treatment regimes

Biom J. 2018 Sep;60(5):991-1002. doi: 10.1002/bimj.201700322. Epub 2018 May 30.

Authors

Michael P Wallace¹, Erica E M Moodie², David A Stephens³

Affiliations

¹ Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, N2L 3G1, Canada.
² Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, QC, H3A 0G4, Canada.
³ Department of Mathematics and Statistics, McGill University, Montreal, QC, H3A 0G4, Canada.

PMID: 29845644
DOI: 10.1002/bimj.201700322

Abstract

Personalized medicine optimizes patient outcome by tailoring treatments to patient-level characteristics. This approach is formalized by dynamic treatment regimes (DTRs): decision rules that take patient information as input and output recommended treatment decisions. The DTR literature has seen the development of increasingly sophisticated causal inference techniques that attempt to address the limitations of our typically observational datasets. Often overlooked, however, is that in practice most patients may be expected to receive optimal or near-optimal treatment, and so the outcome used as part of a typical DTR analysis may provide limited information. In light of this, we propose considering a more standard analysis: ignore the outcome and elicit an optimal DTR by modeling the observed treatment as a function of relevant covariates. This offers a far simpler analysis and, in some settings, improved optimal treatment identification. To distinguish this approach from more traditional DTR analyses, we term it reward ignorant modeling, and also introduce the concept of multimethod analysis, whereby different analysis methods are used in settings with multiple treatment decisions. We demonstrate this concept through a variety of simulation studies, and through analysis of data from the International Warfarin Pharmacogenetics Consortium, which also serve as motivation for this work.

Keywords: adaptive treatment strategies; causal inference; dynamic treatment regimes; personalized medicine.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Biometry / methods*
Humans
Models, Statistical*
Precision Medicine*
Sample Size
Treatment Outcome

Grants and funding

Project Grant/CIHR/Canada