Projects per year
Abstract
The imbalanced classification problem is very relevant in both academic and industrial applications. The task of finding the best machine learning model to use for a specific imbalanced dataset is complicated due to a large number of existing algorithms, each with its own hyperparameters. The Combined Algorithm Selection and Hyperparameter optimization (CASH) has been introduced to tackle both aspects at the same time. However, CASH has not been studied in detail in the class imbalance domain, where the best combination of resampling technique and classification algorithm is searched for, together with their optimized hyperparameters. Thus, we target the CASH problem for imbalanced classification. We experiment with a search space of 5 classification algorithms, 21 resampling approaches and 64 relevant hyperparameters in total. Moreover, we investigate performance of 2 well-known optimization approaches: Random search and Tree Parzen Estimators approach which is a kind of Bayesian optimization. For comparison, we also perform grid search on all combinations of resampling techniques and classification algorithms with their default hyperparameters. Our experimental results show that a Bayesian optimization approach outperforms the other approaches for CASH in this application domain.
Original language | English |
---|---|
Title of host publication | 2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA) |
Publisher | IEEE |
Pages | 1-9 |
Number of pages | 9 |
ISBN (Electronic) | 9781665420990 |
ISBN (Print) | 9781665421003 |
DOIs | |
Publication status | Published - 20 Oct 2021 |
Externally published | Yes |
Event | 2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA) - Porto, Portugal Duration: 6 Oct 2021 → 9 Oct 2021 |
Publication series
Name | International Conference on Data Science and Advanced Analytics (DSAA) |
---|---|
Publisher | IEEE |
ISSN (Print) | 2472-1573 |
ISSN (Electronic) | 2766-4112 |
Conference
Conference | 2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA) |
---|---|
Period | 6/10/21 → 9/10/21 |
Bibliographical note
This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement number 766186 (ECOLE).Keywords
- Machine learning algorithms
- Conferences
- Machine learning
- Data science
- Classification algorithms
- Bayes methods
- Task analysis
Fingerprint
Dive into the research topics of 'Improved automated cash optimization with Tree Parzen Estimators for class imbalance problems'. Together they form a unique fingerprint.Projects
- 1 Finished
-
H2020_ITN_ECOLE_Coordinator
Yao, X. (Principal Investigator)
European Commission, European Commission - Management Costs
1/04/18 → 31/03/22
Project: Research