Evolving Edited K-Nearest Neighbor Classifiers

R Gil-Pita; Xin Yao

doi:10.1142/S0129065708001725

Evolving Edited K-Nearest Neighbor Classifiers

R Gil-Pita, Xin Yao

Computer Science

Research output: Contribution to journal › Article

28 Citations (Scopus)

Abstract

The k-nearest neighbor method is a classifier based on the evaluation of the distances to each pattern in the training set. The edited version of this method consists of the application of this classifier with a subset of the complete training set in which some of the training patterns are excluded, in order to reduce the classification error rate. In recent works, genetic algorithms have been successfully applied to determine which patterns must be included in the edited subset. In this paper we propose a novel implementation of a genetic algorithm for designing edited k-nearest neighbor classifiers. It includes the definition of a novel mean square error based fitness function, a novel clustered crossover technique, and the proposal of a fast smart mutation scheme. In order to evaluate the performance of the proposed method, results using the breast cancer database, the diabetes database and the letter recognition database from the UCI machine learning benchmark repository have been included. Both error rate and computational cost have been considered in the analysis. Obtained results show the improvement achieved by the proposed editing method.

Original language	English
Pages (from-to)	459-467
Number of pages	9
Journal	International Journal of Neural Systems
Volume	18
Issue number	6
DOIs	https://doi.org/10.1142/S0129065708001725
Publication status	Published - 1 Dec 2008

Keywords

evolutionary algorithms
genetic algorithms
machine learning
classification
Nearest neighbour classifiers

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1142/S0129065708001725

Cite this

@article{8fc8a93c92934938abfdfa9827f8c905,

title = "Evolving Edited K-Nearest Neighbor Classifiers",

abstract = "The k-nearest neighbor method is a classifier based on the evaluation of the distances to each pattern in the training set. The edited version of this method consists of the application of this classifier with a subset of the complete training set in which some of the training patterns are excluded, in order to reduce the classification error rate. In recent works, genetic algorithms have been successfully applied to determine which patterns must be included in the edited subset. In this paper we propose a novel implementation of a genetic algorithm for designing edited k-nearest neighbor classifiers. It includes the definition of a novel mean square error based fitness function, a novel clustered crossover technique, and the proposal of a fast smart mutation scheme. In order to evaluate the performance of the proposed method, results using the breast cancer database, the diabetes database and the letter recognition database from the UCI machine learning benchmark repository have been included. Both error rate and computational cost have been considered in the analysis. Obtained results show the improvement achieved by the proposed editing method.",

keywords = "evolutionary algorithms, genetic algorithms, machine learning, classification, Nearest neighbour classifiers",

author = "R Gil-Pita and Xin Yao",

year = "2008",

month = dec,

day = "1",

doi = "10.1142/S0129065708001725",

language = "English",

volume = "18",

pages = "459--467",

journal = "International Journal of Neural Systems",

publisher = "World Scientific",

number = "6",

}

TY - JOUR

T1 - Evolving Edited K-Nearest Neighbor Classifiers

AU - Gil-Pita, R

AU - Yao, Xin

PY - 2008/12/1

Y1 - 2008/12/1

N2 - The k-nearest neighbor method is a classifier based on the evaluation of the distances to each pattern in the training set. The edited version of this method consists of the application of this classifier with a subset of the complete training set in which some of the training patterns are excluded, in order to reduce the classification error rate. In recent works, genetic algorithms have been successfully applied to determine which patterns must be included in the edited subset. In this paper we propose a novel implementation of a genetic algorithm for designing edited k-nearest neighbor classifiers. It includes the definition of a novel mean square error based fitness function, a novel clustered crossover technique, and the proposal of a fast smart mutation scheme. In order to evaluate the performance of the proposed method, results using the breast cancer database, the diabetes database and the letter recognition database from the UCI machine learning benchmark repository have been included. Both error rate and computational cost have been considered in the analysis. Obtained results show the improvement achieved by the proposed editing method.

AB - The k-nearest neighbor method is a classifier based on the evaluation of the distances to each pattern in the training set. The edited version of this method consists of the application of this classifier with a subset of the complete training set in which some of the training patterns are excluded, in order to reduce the classification error rate. In recent works, genetic algorithms have been successfully applied to determine which patterns must be included in the edited subset. In this paper we propose a novel implementation of a genetic algorithm for designing edited k-nearest neighbor classifiers. It includes the definition of a novel mean square error based fitness function, a novel clustered crossover technique, and the proposal of a fast smart mutation scheme. In order to evaluate the performance of the proposed method, results using the breast cancer database, the diabetes database and the letter recognition database from the UCI machine learning benchmark repository have been included. Both error rate and computational cost have been considered in the analysis. Obtained results show the improvement achieved by the proposed editing method.

KW - evolutionary algorithms

KW - genetic algorithms

KW - machine learning

KW - classification

KW - Nearest neighbour classifiers

U2 - 10.1142/S0129065708001725

DO - 10.1142/S0129065708001725

M3 - Article

C2 - 19145662

VL - 18

SP - 459

EP - 467

JO - International Journal of Neural Systems

JF - International Journal of Neural Systems

IS - 6

ER -

Evolving Edited K-Nearest Neighbor Classifiers

Abstract

Keywords

UN SDGs

Access to Document

Fingerprint

Cite this