Multistation collaborative prediction of air pollutants based on the CNN-BiLSTM model

Environ Sci Pollut Res Int. 2023 Aug;30(40):92417-92435. doi: 10.1007/s11356-023-28877-z. Epub 2023 Jul 25.

Abstract

The development of industry has led to serious air pollution problems. It is very important to establish high-precision and high-performance air quality prediction models and take corresponding control measures. In this paper, based on 4 years of air quality and meteorological data from Tianjin, China, the relationships between various meteorological factors and air pollutant concentrations are analyzed. A hybrid deep learning model consisting of a convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM) is proposed to predict pollutant concentrations. In addition, a Bayesian optimization algorithm is applied to obtain the optimal combination of hyperparameters for the proposed deep learning model, which enhances the generalization ability of the model. Furthermore, based on air quality data from multiple stations in the region, a multistation collaborative prediction method is designed, and the concept of a strongly correlated station (SCS) is defined. The predictive model is modified using the idea of SCS and is used to predict the pollutant concentration in Tianjin. The coefficient of determination R2 of PM2.5, PM10, SO2, NO2, CO, and O3 are 0.89, 0.84, 0.69, 0.83, 0.92, and 0.84, respectively. The results show that our model is capable of dealing with air pollutant prediction with satisfactory accuracy.

Keywords: CNN-BiLSTM; Deep learning; Multistation collaborative prediction; PM2.5; Pollutant concentration prediction; Strongly correlated station.

MeSH terms

  • Air Pollutants* / analysis
  • Air Pollution* / analysis
  • Bayes Theorem
  • China
  • Environmental Monitoring / methods
  • Neural Networks, Computer
  • Particulate Matter / analysis

Substances

  • Air Pollutants
  • Particulate Matter