Abstract
Particulate nitrate, a key component of fine particles, forms through the intricate gas-to-particle conversion process. This process is regulated by the gas-to-particle conversion coefficient of nitrate (ε(NO3−)). The mechanism between ε(NO3−) and its drivers is highly complex and nonlinear, and can be characterized by machine learning methods. However, conventional machine learning often yields results that lack clear physical meaning and may even contradict established physical/chemical mechanisms due to the influence of ambient factors. It urgently needs an alternative approach that possesses transparent physical interpretations and provides deeper insights into the impact of ε(NO3−). Here we introduce a supervised machine learning approach—the multilevel nested random forest guided by theory approaches. Our approach robustly identifies NH4+, SO42−, and temperature as pivotal drivers for ε(NO3−). Notably, substantial disparities exist between the outcomes of traditional random forest analysis and the anticipated actual results. Furthermore, our approach underscores the significance of NH4+ during both daytime (30%) and nighttime (40%) periods, while appropriately downplaying the influence of some less relevant drivers in comparison to conventional random forest analysis. This research underscores the transformative potential of integrating domain knowledge with machine learning in atmospheric studies.
Original language | English |
---|---|
Article number | 100333 |
Number of pages | 9 |
Journal | Environmental Science and Ecotechnology |
Volume | 19 |
Early online date | 19 Oct 2023 |
DOIs | |
Publication status | Published - May 2024 |
Bibliographical note
Publisher Copyright:© 2023 The Authors
Keywords
- Data driven
- Domain knowledge
- Guide
- Machine learning
- Theoretical approach
ASJC Scopus subject areas
- Environmental Engineering
- Ecology
- Environmental Science (miscellaneous)