Constructing a pollen proxy from low-cost Optical Particle Counter (OPC) data processed with Neural Networks and Random Forests

Sci Total Environ. 2023 May 1:871:161969. doi: 10.1016/j.scitotenv.2023.161969. Epub 2023 Feb 6.

Abstract

Pollen allergies affect a significant proportion of the global population, and this is expected to worsen in years to come. There is demand for the development of automated pollen monitoring systems. Low-cost Optical Particle Counters (OPCs) measure particulate matter and have attractive advantages of real-time high time resolution data and affordable costs. This study asks whether low-cost OPC sensors can be used for meaningful monitoring of airborne pollen. We employ a variety of methods, including supervised machine learning techniques, to construct pollen proxies from hourly-average OPC data and evaluate their performance, holding out 40 % of observations to test the proxies. The most successful methods are supervised machine learning Neural Network (NN) and Random Forest (RF) methods, trained from pollen concentrations collected from a Hirst-type sampler. These perform significantly better than using a simple particle size-filtered proxy or a Positive Matrix Factorisation (PMF) source apportionment pollen proxy. Twelve NN and RF models were developed to construct a pollen proxy, each varying by model type, input features and target variable. The results show that such models can construct useful information on pollen from OPC data. The best metrics achieved (Spearman correlation coefficient = 0.85, coefficient of determination = 0.67) were for the NN model constructing a Poaceae (grass) pollen proxy, based on particle size information, temperature, and relative humidity. Ability to distinguish high pollen events was evaluated using F1 Scores, a score reflecting the fraction of true positives with respect to false positives and false negatives, with promising results (F1 ≤ 0.83). Model-constructed proxies demonstrated the ability to follow monthly and diurnal trends in pollen. We discuss the suitability of OPCs for monitoring pollen and offer advice for future progress. We demonstrate an attractive alternative for automated pollen monitoring that could provide valuable and timely information to the benefit of pollen allergy sufferers.

Keywords: Aerobiology; Automatic monitoring; Low-cost sensors; Machine learning; Optical Particle Counter (OPC); Pollen.

MeSH terms

  • Allergens
  • Environmental Monitoring / methods
  • Neural Networks, Computer
  • Particle Size
  • Particulate Matter / analysis
  • Poaceae
  • Pollen* / chemistry
  • Random Forest*

Substances

  • Particulate Matter
  • Allergens