Safe deep reinforcement learning in diesel engine emission control

Armin Norouzi; Saeid Shahpouri; David Gordon; Mahdi Shahbakhti; Charles Robert Koch

doi:10.1177/09596518231153445

Safe deep reinforcement learning in diesel engine emission control

Proc Inst Mech Eng Part I J Syst Control Eng. 2023 Sep;237(8):1440-1453. doi: 10.1177/09596518231153445. Epub 2023 Feb 17.

Authors

Armin Norouzi¹, Saeid Shahpouri¹, David Gordon¹, Mahdi Shahbakhti¹, Charles Robert Koch¹

Affiliation

¹ Department of Mechanical Engineering, University of Alberta, Edmonton, AB, Canada.

Abstract

A deep reinforcement learning application is investigated to control the emissions of a compression ignition diesel engine. The main purpose of this study is to reduce the engine-out nitrogen oxide $(N O_{x})$ emissions and to minimize fuel consumption while tracking a reference engine load. First, a physics-based engine simulation model is developed in GT-Power and calibrated using experimental data. Using this model and a GT-Power/Simulink co-simulation, a deep deterministic policy gradient is developed. To reduce the risk of an unwanted output, a safety filter is added to the deep reinforcement learning. Based on the simulation results, this filter has no effect on the final trained deep reinforcement learning; however, during the training process, it is crucial to enforce constraints on the controller output. The developed safe reinforcement learning is then compared with an iterative learning controller and a deep neural network-based nonlinear model predictive controller. This comparison shows that the safe reinforcement learning is capable of accurately tracking an arbitrary reference input while the iterative learning controller is limited to a repetitive reference. The comparison between the nonlinear model predictive control and reinforcement learning indicates that for this case reinforcement learning is able to learn the optimal control output directly from the experiment without the need for a model. However, to enforce output constraint for safe learning reinforcement learning, a simple model of system is required. In this work, reinforcement learning was able to reduce $N O_{x}$ emissions more than the nonlinear model predictive control; however, it suffered from slightly higher error in load tracking and a higher fuel consumption.

Keywords: Machine learning; deep learning; diesel engine; emission control; iterative learning control; reinforcement learning; safe learning.