Constructing a pollen proxy from low-cost Optical Particle Counter (OPC) data processed with Neural Networks and Random Forests

Sophie Mills, Dimitris Bousiotis, Jose Maya-Manzano, Fiona Tummon, A. Robert MacKenzie, Francis Pope*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

11 Downloads (Pure)

Abstract

Pollen allergies affect a significant proportion of the global population, and this is expected to worsen in years to come. There is demand for the development of automated pollen monitoring systems. Low-cost Optical Particle Counters (OPCs) measure particulate matter and have attractive advantages of real-time high time resolution data and affordable costs. This study asks whether low-cost OPC sensors can be used for meaningful monitoring of airborne pollen.

We employ a variety of methods, including supervised machine learning techniques, to construct pollen proxies from hourly-average OPC data and evaluate their performance, holding out 40 % of observations to test the proxies. The most successful methods are supervised machine learning Neural Network (NN) and Random Forest (RF) methods, trained from pollen concentrations collected from a Hirst-type sampler. These perform significantly better than using a simple particle size-filtered proxy or a Positive Matrix Factorisation (PMF) source apportionment pollen proxy. Twelve NN and RF models were developed to construct a pollen proxy, each varying by model type, input features and target variable. The results show that such models can construct useful information on pollen from OPC data. The best metrics achieved (Spearman correlation coefficient = 0.85, coefficient of determination = 0.67) were for the NN model constructing a Poaceae (grass) pollen proxy, based on particle size information, temperature, and relative humidity. Ability to distinguish high pollen events was evaluated using F1 Scores, a score reflecting the fraction of true positives with respect to false positives and false negatives, with promising results (F1 ≤ 0.83). Model-constructed proxies demonstrated the ability to follow monthly and diurnal trends in pollen. We discuss the suitability of OPCs for monitoring pollen and offer advice for future progress. We demonstrate an attractive alternative for automated pollen monitoring that could provide valuable and timely information to the benefit of pollen allergy sufferers.
Original languageEnglish
Article number161969
Number of pages13
JournalScience of the Total Environment
Volume871
Early online date6 Feb 2023
DOIs
Publication statusPublished - 1 May 2023

Bibliographical note

Funding
The work here involving the OPC sensors and related analysis was funded by the Natural Environment Research Council (NERC) through its Central England NERC Training Alliance (CENTA) doctoral research training consortium, at the University of Birmingham, and the grant “Quantification of Utility of Atmospheric Network Technologies (QUANT)” (NE/T001968/1). The intercomparison campaign where the OPC sensors were deployed and accompanying Hirst reference data was obtained for context of this study was funded by the Bayerisches Landesamt für Gesundheit und Lebensmittelsicherheit (LGL) and EUMETNET AutoPollen Programme. Financial support was also received for this from the COST Action CA18226 ADOPT – New approaches in detection of pathogens and aeroallergens.

Fingerprint

Dive into the research topics of 'Constructing a pollen proxy from low-cost Optical Particle Counter (OPC) data processed with Neural Networks and Random Forests'. Together they form a unique fingerprint.

Cite this