Optimal medication dosing from suboptimal clinical examples: a deep reinforcement learning approach

Shamim Nemati; Mohammad M Ghassemi; Gari D Clifford

doi:10.1109/EMBC.2016.7591355

Optimal medication dosing from suboptimal clinical examples: a deep reinforcement learning approach

Annu Int Conf IEEE Eng Med Biol Soc. 2016 Aug:2016:2978-2981. doi: 10.1109/EMBC.2016.7591355.

Authors

Shamim Nemati, Mohammad M Ghassemi, Gari D Clifford

PMID: 28268938
DOI: 10.1109/EMBC.2016.7591355

Abstract

Misdosing medications with sensitive therapeutic windows, such as heparin, can place patients at unnecessary risk, increase length of hospital stay, and lead to wasted hospital resources. In this work, we present a clinician-in-the-loop sequential decision making framework, which provides an individualized dosing policy adapted to each patient's evolving clinical phenotype. We employed retrospective data from the publicly available MIMIC II intensive care unit database, and developed a deep reinforcement learning algorithm that learns an optimal heparin dosing policy from sample dosing trails and their associated outcomes in large electronic medical records. Using separate training and testing datasets, our model was observed to be effective in proposing heparin doses that resulted in better expected outcomes than the clinical guidelines. Our results demonstrate that a sequential modeling approach, learned from retrospective data, could potentially be used at the bedside to derive individualized patient dosing policies.

MeSH terms

Algorithms
Databases, Factual
Dose-Response Relationship, Drug
Heparin / administration & dosage*
Heparin / pharmacology*
Humans
Learning*
Length of Stay
Markov Chains
Reinforcement, Psychology*
Retrospective Studies

Substances

Heparin

Grants and funding

K01 ES025445/ES/NIEHS NIH HHS/United States