LaSVM-based big data learning system for dynamic prediction of air pollution in Tehran

Environ Monit Assess. 2018 Apr 20;190(5):300. doi: 10.1007/s10661-018-6659-6.

Abstract

Due to critical impacts of air pollution, prediction and monitoring of air quality in urban areas are important tasks. However, because of the dynamic nature and high spatio-temporal variability, prediction of the air pollutant concentrations is a complex spatio-temporal problem. Distribution of pollutant concentration is influenced by various factors such as the historical pollution data and weather conditions. Conventional methods such as the support vector machine (SVM) or artificial neural networks (ANN) show some deficiencies when huge amount of streaming data have to be analyzed for urban air pollution prediction. In order to overcome the limitations of the conventional methods and improve the performance of urban air pollution prediction in Tehran, a spatio-temporal system is designed using a LaSVM-based online algorithm. Pollutant concentration and meteorological data along with geographical parameters are continually fed to the developed online forecasting system. Performance of the system is evaluated by comparing the prediction results of the Air Quality Index (AQI) with those of a traditional SVM algorithm. Results show an outstanding increase of speed by the online algorithm while preserving the accuracy of the SVM classifier. Comparison of the hourly predictions for next coming 24 h, with those of the measured pollution data in Tehran pollution monitoring stations shows an overall accuracy of 0.71, root mean square error of 0.54 and coefficient of determination of 0.81. These results are indicators of the practical usefulness of the online algorithm for real-time spatial and temporal prediction of the urban air quality.

Keywords: Big data; LaSVM; Online prediction; Spatio-temporal; Tehran; Urban air quality.

MeSH terms

  • Air Pollutants / analysis
  • Air Pollution / analysis
  • Air Pollution / statistics & numerical data*
  • Environmental Monitoring / methods*
  • Environmental Pollution
  • Forecasting
  • Iran
  • Neural Networks, Computer
  • Particulate Matter / analysis
  • Support Vector Machine
  • Weather

Substances

  • Air Pollutants
  • Particulate Matter