Reinforcement Learning for Radiotherapy Dose Fractioning Automation

Grégoire Moreau; Vincent François-Lavet; Paul Desbordes; Benoît Macq

doi:10.3390/biomedicines9020214

Reinforcement Learning for Radiotherapy Dose Fractioning Automation

Biomedicines. 2021 Feb 19;9(2):214. doi: 10.3390/biomedicines9020214.

Authors

Grégoire Moreau¹, Vincent François-Lavet¹, Paul Desbordes¹, Benoît Macq¹

Affiliation

¹ Institute of Information and Communication Technologies, Electronics and Applied Mathematics, UCLouvain, 1348 Louvain-la-Neuve, Belgium.

Abstract

External beam radiotherapy cancer treatment aims to deliver dose fractions to slowly destroy a tumor while avoiding severe side effects in surrounding healthy tissues. To automate the dose fraction schedules, this paper investigates how deep reinforcement learning approaches (based on deep Q network and deep deterministic policy gradient) can learn from a model of a mixture of tumor and healthy cells. A 2D tumor growth simulation is used to simulate radiation effects on tissues and thus training an agent to automatically optimize dose fractionation. Results show that initiating treatment with large dose per fraction, and then gradually reducing it, is preferred to the standard approach of using a constant dose per fraction.

Keywords: automatic treatment planning; cellular simulation; reinforcement learning.

Grants and funding

protherwall/Gouvernement Wallon