Applicability of machine learning in modeling of atmospheric particle pollution in Bangladesh

Air Qual Atmos Health. 2020;13(10):1247-1256. doi: 10.1007/s11869-020-00878-8. Epub 2020 Jul 20.

Abstract

Atmospheric particle pollution causes acute and chronic health effects. Predicting the concentrations of PM2.5 and PM10, therefore, is a prerequisite to avoid the consequences and mitigate the complications. This research utilized the machine learning (ML) models such as linear-support vector machine (L-SVM), medium Gaussian-support vector machine (M-SVM), Gaussian process regression (GPR), artificial neural network (ANN), random forest regression (RFR), and a time series model namely PROPHET. Atmospheric NOX, SO2, CO, and O3, along with meteorological variables from Dhaka, Chattogram, Rajshahi, and Sylhet for the period of 2013 to 2019, were utilized as exploratory variables. Results showed that the overall performance of GPR performed better particularly for Dhaka in predicting the concentration of both PM2.5 and PM10 while ANN performed best in case of Chattogram and Sylhet for predicting PM2.5. However, in terms of predicting PM10, M-SVM and RFR were selected respectively. Therefore, this study recommends utilizing "ensemble learning" models by combining several best models to advance application of ML in predicting pollutants' concentration in Bangladesh.

Keywords: ANN; Bangladesh; GPR; Machine learning; PROPHET; Particulate matter; RFR; SVM.