Clustering large probabilistic graphs using multi-population evolutionary algorithm

Zahid Halim; Muhammad Waqas; Syed Fawad Hussain

doi:10.1016/j.ins.2015.04.043

Clustering large probabilistic graphs using multi-population evolutionary algorithm

Zahid Halim^*, Muhammad Waqas, Syed Fawad Hussain

^*Corresponding author for this work

Computer Science

Research output: Contribution to journal › Article › peer-review

34 Citations (Scopus)

Abstract

Determining valid clustering is an important research problem. This problem becomes complex if the underlying data has inherent uncertainties. The work presented in this paper deals with clustering large probabilistic graphs using multi-population evolutionary algorithm. The evolutionary algorithm (EA) initializes its multiple populations, each representing a deterministic version of the same probabilistic graph given to it as an input. Multiple deterministic versions of the same input graph are generated by applying different thresholds to the edges. Each chromosome of the multiple populations represents one complete clustering solution. For the purpose of clustering, EA is employed which is guided by pKwikCluster algorithm. The proposed approach is tested on two natively probabilistic graphs and nine synthetically converted probabilistic graphs using cluster validity indices of Davies-Bouldin index, Dunn index, and Silhouette coefficient. The proposed approach is also compared with two baseline clustering algorithms for uncertain data, Fuzzy-DBSCAN and uncertain K-mean and two state-of-the-art approaches for clustering probabilistic graphs. The results obtained suggest that the proposed solution gives better performance than the baseline methods and the state-of-the-art algorithms.

Original language	English
Pages (from-to)	78-95
Number of pages	18
Journal	Information Sciences
Volume	317
DOIs	https://doi.org/10.1016/j.ins.2015.04.043
Publication status	Published - 1 Oct 2015

Bibliographical note

Publisher Copyright:
© 2015 Elsevier Inc. All rights reserved.

Keywords

Clustering
Graph mining
Multi-population evolutionary algorithm
Probabilistic graphs

ASJC Scopus subject areas

Software
Control and Systems Engineering
Theoretical Computer Science
Computer Science Applications
Information Systems and Management
Artificial Intelligence

Access to Document

10.1016/j.ins.2015.04.043

Cite this

@article{c43d435d9b8b4468aca6ca8844bb5761,

title = "Clustering large probabilistic graphs using multi-population evolutionary algorithm",

abstract = "Determining valid clustering is an important research problem. This problem becomes complex if the underlying data has inherent uncertainties. The work presented in this paper deals with clustering large probabilistic graphs using multi-population evolutionary algorithm. The evolutionary algorithm (EA) initializes its multiple populations, each representing a deterministic version of the same probabilistic graph given to it as an input. Multiple deterministic versions of the same input graph are generated by applying different thresholds to the edges. Each chromosome of the multiple populations represents one complete clustering solution. For the purpose of clustering, EA is employed which is guided by pKwikCluster algorithm. The proposed approach is tested on two natively probabilistic graphs and nine synthetically converted probabilistic graphs using cluster validity indices of Davies-Bouldin index, Dunn index, and Silhouette coefficient. The proposed approach is also compared with two baseline clustering algorithms for uncertain data, Fuzzy-DBSCAN and uncertain K-mean and two state-of-the-art approaches for clustering probabilistic graphs. The results obtained suggest that the proposed solution gives better performance than the baseline methods and the state-of-the-art algorithms.",

keywords = "Clustering, Graph mining, Multi-population evolutionary algorithm, Probabilistic graphs",

author = "Zahid Halim and Muhammad Waqas and Hussain, {Syed Fawad}",

year = "2015",

month = oct,

day = "1",

doi = "10.1016/j.ins.2015.04.043",

language = "English",

volume = "317",

pages = "78--95",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier",

}

TY - JOUR

T1 - Clustering large probabilistic graphs using multi-population evolutionary algorithm

AU - Halim, Zahid

AU - Waqas, Muhammad

AU - Hussain, Syed Fawad

PY - 2015/10/1

Y1 - 2015/10/1

N2 - Determining valid clustering is an important research problem. This problem becomes complex if the underlying data has inherent uncertainties. The work presented in this paper deals with clustering large probabilistic graphs using multi-population evolutionary algorithm. The evolutionary algorithm (EA) initializes its multiple populations, each representing a deterministic version of the same probabilistic graph given to it as an input. Multiple deterministic versions of the same input graph are generated by applying different thresholds to the edges. Each chromosome of the multiple populations represents one complete clustering solution. For the purpose of clustering, EA is employed which is guided by pKwikCluster algorithm. The proposed approach is tested on two natively probabilistic graphs and nine synthetically converted probabilistic graphs using cluster validity indices of Davies-Bouldin index, Dunn index, and Silhouette coefficient. The proposed approach is also compared with two baseline clustering algorithms for uncertain data, Fuzzy-DBSCAN and uncertain K-mean and two state-of-the-art approaches for clustering probabilistic graphs. The results obtained suggest that the proposed solution gives better performance than the baseline methods and the state-of-the-art algorithms.

AB - Determining valid clustering is an important research problem. This problem becomes complex if the underlying data has inherent uncertainties. The work presented in this paper deals with clustering large probabilistic graphs using multi-population evolutionary algorithm. The evolutionary algorithm (EA) initializes its multiple populations, each representing a deterministic version of the same probabilistic graph given to it as an input. Multiple deterministic versions of the same input graph are generated by applying different thresholds to the edges. Each chromosome of the multiple populations represents one complete clustering solution. For the purpose of clustering, EA is employed which is guided by pKwikCluster algorithm. The proposed approach is tested on two natively probabilistic graphs and nine synthetically converted probabilistic graphs using cluster validity indices of Davies-Bouldin index, Dunn index, and Silhouette coefficient. The proposed approach is also compared with two baseline clustering algorithms for uncertain data, Fuzzy-DBSCAN and uncertain K-mean and two state-of-the-art approaches for clustering probabilistic graphs. The results obtained suggest that the proposed solution gives better performance than the baseline methods and the state-of-the-art algorithms.

KW - Clustering

KW - Graph mining

KW - Multi-population evolutionary algorithm

KW - Probabilistic graphs

UR - http://www.scopus.com/inward/record.url?scp=84930148042&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2015.04.043

DO - 10.1016/j.ins.2015.04.043

M3 - Article

AN - SCOPUS:84930148042

SN - 0020-0255

VL - 317

SP - 78

EP - 95

JO - Information Sciences

JF - Information Sciences

ER -

Clustering large probabilistic graphs using multi-population evolutionary algorithm

Abstract

Bibliographical note

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Cite this