An autonomous operational trajectory searching system for an economic and environmental membrane bioreactor plant using deep reinforcement learning

Water Sci Technol. 2020 Apr;81(8):1578-1587. doi: 10.2166/wst.2020.053.

Abstract

Optimal operation of membrane bioreactor (MBR) plants is crucial to save operational costs while satisfying legal effluent discharge requirements. The aeration process of MBR plants tends to use excessive energy for supplying air to micro-organisms. In the present study, a novel optimal aeration system is proposed for dynamic and robust optimization. Accordingly, a deep reinforcement learning (DRL)-based optimal operating system is proposed, so as to meet stringent discharge qualities while maximizing the system's energy efficiency. Additionally, it is compared with the manual system and conventional reinforcement learning (RL)-based systems. A deep Q-network (DQN) algorithm automatically learns how to operate the plant efficiently by finding an optimal trajectory to reduce the aeration energy without degrading the treated water quality. A full-scale MBR plant with the DQN-based autonomous aeration system can decrease the MBR's aeration energy consumption by 34% compared to other aeration systems while maintaining the treatment efficiency within effluent discharge limits.

MeSH terms

  • Algorithms
  • Bioreactors*
  • Membranes, Artificial
  • Waste Disposal, Fluid*

Substances

  • Membranes, Artificial