Hierarchical Deep Reinforcement Learning-Based Propofol Infusion Assistant Framework in Anesthesia

IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):2510-2521. doi: 10.1109/TNNLS.2022.3190379. Epub 2024 Feb 5.

Abstract

This article aims to provide a hierarchical reinforcement learning (RL)-based solution to the automated drug infusion field. The learning policy is divided into the tasks of: 1) learning trajectory generative model and 2) planning policy model. The proposed deep infusion assistant policy gradient (DIAPG) model draws inspiration from adversarial autoencoders (AAEs) and learns latent representations of hypnotic depth trajectories. Given the trajectories drawn from the generative model, the planning policy infers a dose of propofol for stable sedation of a patient under total intravenous anesthesia (TIVA) using propofol and remifentanil. Through extensive evaluation, the DIAPG model can effectively stabilize bispectral index (BIS) and effect site concentration given a potentially time-varying target sequence. The proposed DIAPG shows an increased performance of 530% and 15% when a human expert and a standard reinforcement algorithm are used to infuse drugs, respectively.

MeSH terms

  • Anesthesia*
  • Anesthetics, Intravenous
  • Electroencephalography
  • Humans
  • Neural Networks, Computer
  • Piperidines
  • Propofol*

Substances

  • Propofol
  • Anesthetics, Intravenous
  • Piperidines