Adaptable control policies for variable liquid chromatography columns using deep reinforcement learning

David Andersson; Christoffer Edlund; Brandon Corbett; Rickard Sjögren

doi:10.1038/s41598-023-38145-2

Adaptable control policies for variable liquid chromatography columns using deep reinforcement learning

Sci Rep. 2023 Jul 12;13(1):11270. doi: 10.1038/s41598-023-38145-2.

Authors

David Andersson^#¹, Christoffer Edlund^#^{2

3

4}, Brandon Corbett⁵, Rickard Sjögren^{2

6

7}

Affiliations

¹ Sartorius Corporate Research, Umeå, Sweden. david.andersson@sartorius.com.
² Sartorius Corporate Research, Umeå, Sweden.
³ V7 Ltd, London, UK.
⁴ ZensAI AB, Umeå, Sweden.
⁵ Sartorius Corporate Research, Toronto, ON, Canada.
⁶ SynGen AI Technologies AB, Umeå, Sweden.
⁷ CellVoyant Technologies Ltd., Bristol, UK.

^# Contributed equally.

Abstract

Controlling chromatography systems for downstream processing of biotherapeutics is challenging because of the highly nonlinear behavior of feed components and complex interactions with binding phases. This challenge is exacerbated by the highly variable binding properties of the chromatography columns. Furthermore, the inability to collect information inside chromatography columns makes real-time control even more problematic. Typical static control policies either perform sub optimally on average owing to column variability or need to be adapted for each column requiring expensive experimentation. Exploiting the recent advances in simulation-based data generation and deep reinforcement learning, we present an adaptable control policy that is learned in a data-driven manner. Our controller learns a control policy by directly manipulating the inlet and outlet flow rates to optimize a reward function that specifies the desired outcome. Training our controller on columns with high variability enables us to create a single policy that adapts to multiple variable columns. Moreover, we show that our learned policy achieves higher productivity, albeit with a somewhat lower purity, than a human-designed benchmark policy. Our study shows that deep reinforcement learning offers a promising route to develop adaptable control policies for more efficient liquid chromatography processing.