Partial Policy-Based Reinforcement Learning for Anatomical Landmark Localization in 3D Medical Images

IEEE Trans Med Imaging. 2020 Apr;39(4):1245-1255. doi: 10.1109/TMI.2019.2946345. Epub 2019 Oct 9.

Abstract

Utilizing the idea of long-term cumulative return, reinforcement learning (RL) has shown remarkable performance in various fields. We follow the formulation of landmark localization in 3D medical images as an RL problem. Whereas value-based methods have been widely used to solve RL-based localization problems, we adopt an actor-critic based direct policy search method framed in a temporal difference learning approach. In RL problems with large state and/or action spaces, learning the optimal behavior is challenging and requires many trials. To improve the learning, we introduce a partial policy-based reinforcement learning to enable solving the large problem of localization by learning the optimal policy on smaller partial domains. Independent actors efficiently learn the corresponding partial policies, each utilizing their own independent critic. The proposed policy reconstruction from the partial policies ensures a robust and efficient localization, where the sub-agents uniformly contribute to the state-transitions based on their simple partial policies mapping to binary actions. Experiments with three different localization problems in 3D CT and MR images showed that the proposed reinforcement learning requires a significantly smaller number of trials to learn the optimal behavior compared to the original behavior learning scheme in RL. It also ensures a satisfactory performance when trained on fewer images.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Anatomic Landmarks / diagnostic imaging*
  • Aorta / diagnostic imaging
  • Humans
  • Imaging, Three-Dimensional / methods*
  • Machine Learning*
  • Magnetic Resonance Imaging
  • Spine / diagnostic imaging
  • Tomography, X-Ray Computed