Reinforcement learning versus proportional-integral-derivative control of hypnosis in a simulated intraoperative patient

Anesth Analg. 2011 Feb;112(2):350-9. doi: 10.1213/ANE.0b013e318202cb7c. Epub 2010 Dec 14.

Abstract

Background: Research has demonstrated the efficacy of closed-loop control of anesthesia using bispectral index (BIS) as the controlled variable. Model-based and proportional-integral-derivative (PID) controllers outperform manual control. We investigated the application of reinforcement learning (RL), an intelligent systems control method, to closed-loop BIS-guided, propofol-induced hypnosis in simulated intraoperative patients. We also compared the performance of the RL agent against that of a conventional PID controller.

Methods: The RL and PID controllers were evaluated during propofol induction and maintenance of hypnosis. The patient-hypnotic episodes were designed to challenge both controllers with varying degrees of interindividual variation and noxious surgical stimulation. Each controller was tested in 1000 simulated patients, and control performance was assessed by calculating the median performance error (MDPE), median absolute performance error (MDAPE), Wobble, and Divergence for each controller group. A separate analysis was performed for the induction and maintenance phases of hypnosis.

Results: During maintenance, RL control demonstrated an MDPE of -1% and an MDAPE of 3.75%, with 80% of the time at BIS(target) ± 5. The PID controller yielded a MDPE of -8.5% and an MDAPE of 8.6%, with 57% of the time at BIS(target) ± 5. In comparison, the MDAPE in the worst-controlled patient of the RL group was observed to be almost half that of the worst-controlled patient in the PID group.

Conclusions: When compared with the PID controller, RL control resulted in slower induction but less overshoot and faster attainment of steady state. No difference in interindividual patient variation and noxious destabilizing challenge on control performance was observed between the 2 patient groups.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Anesthesia, Closed-Circuit*
  • Anesthetics, Intravenous / administration & dosage*
  • Anesthetics, Intravenous / pharmacokinetics
  • Artificial Intelligence*
  • Computer Simulation*
  • Consciousness Monitors*
  • Dose-Response Relationship, Drug
  • Humans
  • Hypnosis, Anesthetic*
  • Intraoperative Period
  • Male
  • Models, Theoretical*
  • Monitoring, Intraoperative* / instrumentation
  • Monitoring, Intraoperative* / methods
  • Patient Simulation*
  • Pattern Recognition, Automated
  • Propofol / administration & dosage*
  • Propofol / pharmacokinetics
  • Signal Processing, Computer-Assisted
  • Young Adult

Substances

  • Anesthetics, Intravenous
  • Propofol