Fusion Method Combining Ground-Level Observations with Chemical Transport Model Predictions Using an Ensemble Deep Learning Framework: Application in China to Estimate Spatiotemporally-Resolved PM2.5 Exposure Fields in 2014-2017

Baolei Lyu; Yongtao Hu; Wenxian Zhang; Yunsong Du; Bin Luo; Xiaoling Sun; Zhe Sun; Zhu Deng; Xiaojiang Wang; Jun Liu; Xuesong Wang; Armistead G Russell

doi:10.1021/acs.est.9b01117

Fusion Method Combining Ground-Level Observations with Chemical Transport Model Predictions Using an Ensemble Deep Learning Framework: Application in China to Estimate Spatiotemporally-Resolved PM_2.5 Exposure Fields in 2014-2017

Environ Sci Technol. 2019 Jul 2;53(13):7306-7315. doi: 10.1021/acs.est.9b01117. Epub 2019 Jun 21.

Authors

Baolei Lyu¹, Yongtao Hu², Wenxian Zhang³, Yunsong Du⁴, Bin Luo⁴, Xiaoling Sun⁵, Zhe Sun⁶, Zhu Deng⁶, Xiaojiang Wang¹, Jun Liu¹, Xuesong Wang⁷, Armistead G Russell²

Affiliations

¹ Huayun Sounding Meteorological Technology Company, Limited , Beijing 100081 , P. R. China.
² School of Civil and Environmental Engineering , Georgia Institute of Technology , Atlanta , Georgia 30332 , United States.
³ Hangzhou AiMa Technologies , Hangzhou , Zhejiang 311121 , P. R. China.
⁴ Sichuan Environmental Monitoring Center , Chengdu , Sichuan 610091 , P. R. China.
⁵ Meteorological Bureau of Shenzhen Municipality , ShenZhen , Guangdong 518040 , P. R. China.
⁶ Department of Earth System Science , Tsinghua University , Beijing 100084 , P. R. China.
⁷ State Key Joint Laboratory of Environmental Simulation and Pollution Control, College of Environmental Sciences and Engineering , Peking University , Beijing 100871 , China.

PMID: 31244060
DOI: 10.1021/acs.est.9b01117

Abstract

Atmospheric chemical transport models (CTMs) have been widely used to simulate spatiotemporally resolved PM_2.5 concentrations. However, CTM results are usually prone to bias and errors. In this study, we improved the accuracy of PM_2.5 predictions by developing an ensemble deep learning framework to fuse model simulations with ground-level observations. The framework encompasses four machine-learning models, i.e., general linear model, fully connected neural network, random forest, and gradient boosting machine, and combines them by stacking approach. This framework is applied to PM_2.5 concentrations simulated by the Community Multiscale Air Quality (CMAQ) model for China from 2014 to 2017, which has complete spatial coverage over the entirety of China at a 12-km resolution, with no sampling biases. The fused PM_2.5 concentration fields were evaluated by comparing with an independent network of observations. The R² values increased from 0.39 to 0.64, and the RMSE values decreased from 33.7 μg/m³ to 24.8 μg/m³. According to the fused data, the percentage of Chinese population residing under the level II National Ambient Air Quality Standards of 35 μg/m³ for PM_2.5 has increased from 46.5% in 2014 to 61.7% in 2017. The method is readily adapted to utilize near-real-time observations for operational analyses and forecasting of pollutant concentrations and can be extended to provide source apportionment forecasts as well.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Air Pollutants*
Air Pollution*
China
Deep Learning
Environmental Monitoring
Particulate Matter

Substances

Air Pollutants
Particulate Matter