Importance of secondary decomposition in the accurate prediction of daily-scale ozone pollution by machine learning

Sci Total Environ. 2023 Dec 15:904:166963. doi: 10.1016/j.scitotenv.2023.166963. Epub 2023 Sep 9.

Abstract

Machine learning (ML) models have been proven as a reliable tool in predicting ambient pollution concentrations at various places in the world. However, their performance in predicting the maximum daily 8-h averaged ozone (MDA8 O3), the metric often used for O3 pollution assessment and management, is relatively poorer. This is largely resulted from more irregular data fluctuations of the MDA8 O3 levels governed collectively by the synoptic condition, local photochemistry, and long-range transport. In order to improve the prediction accuracy of MDA8 O3, this study developed a secondary decomposition ML model framework which coupled the complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) as the primary decomposition, the variational mode decomposition (VMD) as secondary decomposition, and the gate recurrent unit (GRU) ML model. By applying this secondary decomposition model framework on MDA8 O3 prediction for the first time, we showed that the prediction accuracy of MDA8 O3 is largely improved from R2 of 0.46 and RMSE of 30.4 μg/m3 for GRU without decomposition to R2 of 0.91 and RMSE of 12.6 μg/m3 over the Pearl River Delta of China. We also found that the prediction accuracy rate of O3 pollution non-attainments, an essential indicator for initiating contingency O3 pollution control, improved greatly from 14.9 % for GRU without decomposition to 72.5 %. The performance of O3 pollution non-attainment prediction is relatively higher in southwestern PRD, which is mainly due to greater number and severity of O3 non-attainments in southwestern cities located downwind of the emission hotspot area at central PRD. This study underscored the importance of secondary decomposition in accurately predicting daily-scale O3 concentration and non-attainments over the PRD, which can be extended to other photochemically active region worldwide to improve their O3 prediction accuracy and assist in O3 contingency control.

Keywords: MDA8; Machine learning; Ozone; Pearl River Delta; Secondary decomposition.