Machine learning methods for low-cost pollen monitoring - Model optimisation and interpretability

Sophie A Mills, José M Maya-Manzano, Fiona Tummon, A Rob MacKenzie, Francis Pope*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Downloads (Pure)

Abstract

Pollen is a major issue globally, causing as much as 40 % of the population to suffer from hay fever and other allergic conditions. Current techniques for monitoring pollen are either laborious and slow, or expensive, thus alternative methods are needed to provide timely and more localised information on airborne pollen concentrations. We have demonstrated previously that low-cost Optical Particle Counter (OPC) sensors can be used to estimate pollen concentrations when machine learning methods are used to process the data and learn the relationships between OPC output data and conventionally measured pollen concentrations.

This study demonstrates how methodical hyperparameter tuning can be employed to significantly improve model performance. We present the results of a range of models based on tuned hyperparameter configurations trained to predict Poaceae (Barnhart), Quercus (L.), Betula (L.), Pinus (L.) and total pollen concentrations. The results achieved here are a significant improvement on results we previously reported: the average R2 scores for the total pollen models have at least doubled compared to using previous parameter settings.

Furthermore, we employ the explainable Artificial Intelligence (XAI) technique, SHAP, to interpret the models and understand how each of the input features (i.e. particle sizes) affect the estimated output concentration for each pollen type. In particular, we found that Quercus pollen has a strong positive correlation with particles of optical diameter 1.7–2.3 μm, which distinguishes it from other pollen types such as Poaceae and may suggest that type-specific subpollen particles are present in this size range.

There is much further work to be done, especially in training and testing models on data obtained across different environments to evaluate the extent of generalisability. Nevertheless, this work demonstrates the potential this method can offer for low-cost monitoring of pollen and the valuable insight we can gain from what the model has learned.
Original languageEnglish
Article number165853
Number of pages15
JournalScience of the Total Environment
Volume903
Early online date5 Aug 2023
DOIs
Publication statusPublished - 10 Dec 2023

Bibliographical note

Copyright © 2023 The Authors

Keywords

  • Pollen
  • Bioaerosols
  • Automatic monitoring
  • Low-cost sensors
  • Machine learningExplainable artificial intelligence (XAI)

Fingerprint

Dive into the research topics of 'Machine learning methods for low-cost pollen monitoring - Model optimisation and interpretability'. Together they form a unique fingerprint.

Cite this