Estimation of surface-level NO2 and O3 concentrations using TROPOMI data and machine learning over East Asia

Environ Pollut. 2021 Nov 1:288:117711. doi: 10.1016/j.envpol.2021.117711. Epub 2021 Jul 16.

Abstract

In East Asia, air quality has been recognized as an important public health problem. In particular, the surface concentrations of air pollutants are closely related to human life. This study aims to develop models for estimating high spatial resolution surface concentrations of NO2 and O3 from TROPOspheric Monitoring Instrument (TROPOMI) data in East Asia. The machine learning was adopted by fusion of various satellite-based variables, numerical model-based meteorological variables, and land-use variables. Four machine learning approaches-Support Vector Regression (SVR), Random Forest (RF), Extreme Gradient Boost (XGB), and Light Gradient Boosting Machine (LGBM)-were evaluated and compared with Multiple Linear Regression (MLR) as a base statistical method. This study also modeled the NO2 and O3 concentrations over the ocean surface (i.e., land model for scheme 1 and ocean model for scheme 2). The estimated surface concentrations were validated through three cross-validation approaches (i.e., random, temporal, and spatial). The results showed that the NO2 model produced R2 of 0.63-0.70 and normalized root-mean-square-error (nRMSE) of 38.3-42.2% and the O3 model resulted in R2 of 0.65-0.78 and nRMSE of 19.6-24.7% for scheme 1. The indirect validation based on the stations near the coastline for scheme 2 showed slight decrease (~0.3-2.4%) in nRMSE when compared to scheme 1. The contributions of input variables to the models were analyzed based on SHapely Additive exPlanations (SHAP) values. The NO2 vertical column density among the TROPOMI-derived variables showed the largest contribution in both the NO2 and O3 models.

Keywords: Air quality; Satellite; Spatial and temporal distribution; Trace gas concentration.

MeSH terms

  • Air Pollutants* / analysis
  • Air Pollution* / analysis
  • Asia, Eastern
  • Environmental Monitoring
  • Humans
  • Machine Learning
  • Nitrogen Dioxide / analysis

Substances

  • Air Pollutants
  • Nitrogen Dioxide