A novel approach for the prediction and analysis of daily concentrations of particulate matter using machine learning

Balamurugan Panneerselvam; Nagavinothini Ravichandran; Umesh Chandra Dumka; Maciej Thomas; Warit Charoenlerkthawin; Butsawan Bidorn

doi:10.1016/j.scitotenv.2023.166178

A novel approach for the prediction and analysis of daily concentrations of particulate matter using machine learning

Sci Total Environ. 2023 Nov 1:897:166178. doi: 10.1016/j.scitotenv.2023.166178. Epub 2023 Aug 9.

Authors

Balamurugan Panneerselvam¹, Nagavinothini Ravichandran¹, Umesh Chandra Dumka², Maciej Thomas³, Warit Charoenlerkthawin⁴, Butsawan Bidorn⁵

Affiliations

¹ Center of Excellence in Interdisciplinary Research for Sustainable Development, Faculty of Engineering, Chulalongkorn University, Bangkok 10330, Thailand.
² Aryabhatta Research Institute of Observational Sciences, Nainital 263001, India.
³ Faculty of Environmental Engineering and Energy, Cracow University of Technology, Cracow 31155, Poland.
⁴ Center of Excellence in Interdisciplinary Research for Sustainable Development, Faculty of Engineering, Chulalongkorn University, Bangkok 10330, Thailand; Department of Water Resources Engineering, Chulalongkorn University, Bangkok 10330, Thailand.
⁵ Center of Excellence in Interdisciplinary Research for Sustainable Development, Faculty of Engineering, Chulalongkorn University, Bangkok 10330, Thailand; Department of Water Resources Engineering, Chulalongkorn University, Bangkok 10330, Thailand. Electronic address: butsawan.p@chula.ac.th.

PMID: 37562623
DOI: 10.1016/j.scitotenv.2023.166178

Abstract

Traditional air quality analysis and prediction methods depend on the statistical and numerical analyses of historical air quality data with more information related to a specific region; therefore, the results are unsatisfactory. In particular, fine particulate matter (PM_2.5, PM₁₀) in the atmosphere is a major concern for human health. The modelling (analysis and prediction) of particulate matter concentrations remains unsatisfactory owing to the rapid increase in urbanization and industrialization. In the present study, we reconstructed a prediction model for both PM_2.5 and PM₁₀ with varying meteorological conditions (windspeed, temperature, precipitation, specific humidity, and air pressure) in a specific region. In this study, a prediction model was developed for the two observation stations in the study region. The analysis of particulate matter shows that seasonal variation is a primary factor that highly influences air pollutant concentrations in urban regions. Based on historical data, the maximum number of days (92 days in 2019) during the winter season exceeded the maximum permissible level of particulate matter (PM_2.5 = 15 μg/m³) concentration in air. The prediction results showed better performance of the Gaussian process regression model, with comparatively larger R² values and smaller errors than the other models. Based on the analysis and prediction, these novel methods may enhance the accuracy of particulate matter prediction and influence policy- and decision-makers among pollution control authorities to protect air quality.

Keywords: Gaussian process regression; Machine learning; Meteorological condition; Particulate matters; Prediction.