A novel machine learning method for evaluating the impact of emission sources on ozone formation

Environ Pollut. 2023 Jan 1;316(Pt 2):120685. doi: 10.1016/j.envpol.2022.120685. Epub 2022 Nov 15.

Abstract

Ambient ozone air pollution is one of the most important environmental challenges in China today, and it is particularly significant to identify pollution sources and formulate control strategies. In present study, we proposed a novel method of positive matrix factorization-SHapley Additive explanation (PMF-SHAP) for evaluating the impact of emission sources on ozone formation, which can quantify the main emission sources of ozone pollution. In this method, we first used the PMF model to identify the source of volatile organic compounds (VOCs), and then quantified various emission sources using a combination of machine learning (ML) models and the SHAP algorithm. The R2 of the optimal ML model in this method was as high as 0.96, indicating that the prediction performance was excellent. Furthermore, we explored the impact of different emission sources on ozone formation, and found that ozone formation in Shenzhen was more affected by VOCs, of which vehicle emission sources may have the greatest impact. Our results suggest that the appropriate combination of traditional models with ML models can well address environmental pollution problems. Moreover, the conclusions obtained based on the PMF-SHAP method were different from the traditional ozone formation potential (OFP) results, providing valuable clues for related mechanism studies.

Keywords: Emission sources; Machine learning; Ozone; SHAP; VOCs.

MeSH terms

  • Air Pollution*
  • Environmental Pollution
  • Machine Learning
  • Ozone* / toxicity
  • Volatile Organic Compounds*

Substances

  • Ozone
  • Volatile Organic Compounds