Hierarchical reduced-space drift detection framework for multivariate supervised data streams

Research output: Contribution to journalArticlepeer-review

Authors

Colleges, School and Institutes

Abstract

In a streaming environment, the characteristics of the data themselves and their relationship with the labels are likely to experience changes as time goes on. Most drift detection methods for supervised data streams are performance-based, that is, they detect changes only after the classication accuracy deteriorates. This may not be sufcient in many application areas where the reason behind a drift is also important. Another category of drift detectors are data distribution-based detectors. Although they can detect some drifts within the input space, changes affecting only the labelling mechanism cannot be identied. Furthermore, little work is available on drift detection for high-dimensional supervised data streams. In this paper we propose an advanced Hierarchical Reduced-space Drift Detection Framework for Supervised Data Streams (HRDS) which captures drifts regardless of their effects on classication performance. This framework suggests monitoring both marginal and class-conditional distributions within a lower-dimensional space specically relevant to the assigned classication task. Experimental comparisons have demonstrated that the proposed HRDS not only achieves high-quality performance on high-dimensional data streams, but also outperforms its competitors in terms of detection recall, precision and F-measure across a wide range of different concept drift types including subtle drifts.

Bibliographic note

Final Version of Record not yet available as of 13/10/2021.

Details

Original languageEnglish
JournalIEEE Transactions on Knowledge and Data Engineering
Publication statusAccepted/In press - 24 Aug 2021

Keywords

  • Concept drift, Delays, Detectors, Feature extraction, Licenses, Monitoring, Task analysis, Training, data stream mining, drift detection, online learning