Abstract
We study how the error of an ensemble regression estimator can be decomposed into two components: one accounting for the individual errors and the other accounting for the correlations within the ensemble. This is the well known Ambiguity decomposition; we show an alternative way to decompose the error, and show how both decompositions have been exploited in a learning scheme. Using a scaling parameter in the decomposition we can blend the gradient (and therefore the learning process) smoothly between two extremes, from concentrating on individual accuracies and ignoring diversity, up to a full non-linear optimization of all parameters, treating the ensemble as a single learning unit. We demonstrate how this also applies to ensembles using a soft combination of posterior probability estimates, so can be utilised for classifier ensembles.
Original language | English |
---|---|
Pages (from-to) | 296-305 |
Number of pages | 10 |
Journal | Lecture Notes in Computer Science |
Volume | 3541 |
DOIs | |
Publication status | Published - 1 Jan 2005 |
Event | 6th International Workshop on Multiple Classifier Systems - Duration: 1 Jan 2005 → … |