Application of Machine Learning for the in-Field Correction of a PM2.5 Low-Cost Sensor Network

Wen-Cheng Vincent Wang; Shih-Chun Candice Lung; Chun-Hu Liu

doi:10.3390/s20175002

Application of Machine Learning for the in-Field Correction of a PM_2.5 Low-Cost Sensor Network

Sensors (Basel). 2020 Sep 3;20(17):5002. doi: 10.3390/s20175002.

Authors

Wen-Cheng Vincent Wang¹, Shih-Chun Candice Lung^{1

2

3}, Chun-Hu Liu¹

Affiliations

¹ Research Center for Environmental Changes, Academia Sinica, Nangang, Taipei 115, Taiwan.
² Department of Atmospheric Sciences, National Taiwan University, Taipei 106, Taiwan.
³ Institute of Environmental Health, National Taiwan University, Taipei 106, Taiwan.

Abstract

Many low-cost sensors (LCSs) are distributed for air monitoring without any rigorous calibrations. This work applies machine learning with PM_2.5 from Taiwan monitoring stations to conduct in-field corrections on a network of 39 PM_2.5 LCSs from July 2017 to December 2018. Three candidate models were evaluated: Multiple linear regression (MLR), support vector regression (SVR), and random forest regression (RFR). The model-corrected PM_2.5 levels were compared with those of GRIMM-calibrated PM_2.5. RFR was superior to MLR and SVR in its correction accuracy and computing efficiency. Compared to SVR, the root mean square errors (RMSEs) of RFR were 35% and 85% lower for the training and validation sets, respectively, and the computational speed was 35 times faster. An RFR with 300 decision trees was chosen as the optimal setting considering both the correction performance and the modeling time. An RFR with a nighttime pattern was established as the optimal correction model, and the RMSEs were 5.9 ± 2.0 μg/m³, reduced from 18.4 ± 6.5 μg/m³ before correction. This is the first work to correct LCSs at locations without monitoring stations, validated using laboratory-calibrated data. Similar models could be established in other countries to greatly enhance the usefulness of their PM_2.5 sensor networks.

Keywords: PM sensing device; efficient in-field PM2.5 correction; in-field calibration; particle sensing correction; random forest model.

Abstract

Grants and funding