Abstract
Traditional statistical methods (TSM) and machine learning (ML) methods have been widely used to separate the effects of emissions and meteorology on air pollutant concentrations, while their performance compared to the chemistry transport model has been less fully investigated. Using the Community Multiscale Air Quality Model (CMAQ) as a reference, a series of experiments was conducted to comprehensively investigate the performance of TSM (e.g., multiple linear regression and Kolmogorov–Zurbenko filter) and ML (e.g., random forest and extreme gradient boosting) approaches in quantifying the effects of emissions and meteorology on the trends of fine particulate matter (PM2.5) during 2013−2017. Model performance evaluation metrics suggested that the TSM and ML methods can explain the variations of PM2.5 with the highest performance from ML. The trends of PM2.5 showed insignificant differences (p > 0.05) for both the emission-related (PMEMI2:5) and meteorology-related components between TSM, ML, and CMAQ modeling results. PMEMI2:5 estimated from ML showed the least difference to that from CMAQ. Considering the medium computing resources and low model biases, the ML method is recommended for weather normalization of PM2.5. Sensitivity analysis further suggested that the ML model with optimized hyperparameters and the exclusion of temporal variables in weather normalization can further produce reasonable results in emission-related trends of PM2.5.
Original language | English |
---|---|
Article number | 214 |
Number of pages | 10 |
Journal | npj Climate and Atmospheric Science |
Volume | 6 |
Issue number | 214 |
DOIs | |
Publication status | Published - 20 Dec 2023 |
Bibliographical note
AcknowledgementsThis study was financially supported by the Key Program for Technical Innovation of Hubei Province (2017ACA089) and the National Natural Science Foundation of China (41830965).