A structural cluster kernel for learning on graphs

Madeleine Seeland; Andreas Karwath; Stefan Kramer

doi:10.1145/2339530.2339614

A structural cluster kernel for learning on graphs

Madeleine Seeland, Andreas Karwath, Stefan Kramer

Research output: Contribution to conference (unpublished) › Paper › peer-review

9 Citations (Scopus)

Abstract

In recent years, graph kernels have received considerable interest within the machine learning and data mining community. Here, we introduce a novel approach enabling kernel methods to utilize additional information hidden in the structural neighborhood of the graphs under consideration. Our novel structural cluster kernel (SCK) incorporates similarities induced by a structural clustering algorithm to improve state-of-the-art graph kernels. The approach taken is based on the idea that graph similarity can not only be described by the similarity between the graphs themselves, but also by the similarity they possess with respect to their structural neighborhood. We applied our novel kernel in a supervised and a semi-supervised setting to regression and classification problems on a number of real-world datasets of molecular graphs. Our results show that the structural cluster similarity information can indeed leverage the prediction performance of the base kernel, particularly when the dataset is structurally sparse and consequently structurally diverse. By additionally taking into account a large number of unlabeled instances the performance of the structural cluster kernel can further be improved.

Original language	English
Pages	516-524
DOIs	https://doi.org/10.1145/2339530.2339614
Publication status	Published - 2012

Keywords

cheminformatics, clustering, data mining, kernels, QSAR, suport vector machines

Access to Document

10.1145/2339530.2339614

http://doi.acm.org/10.1145/2339530.2339614

Cite this

@conference{61c18d22fbb9462d8f516c00580028f9,

title = "A structural cluster kernel for learning on graphs",

abstract = "In recent years, graph kernels have received considerable interest within the machine learning and data mining community. Here, we introduce a novel approach enabling kernel methods to utilize additional information hidden in the structural neighborhood of the graphs under consideration. Our novel structural cluster kernel (SCK) incorporates similarities induced by a structural clustering algorithm to improve state-of-the-art graph kernels. The approach taken is based on the idea that graph similarity can not only be described by the similarity between the graphs themselves, but also by the similarity they possess with respect to their structural neighborhood. We applied our novel kernel in a supervised and a semi-supervised setting to regression and classification problems on a number of real-world datasets of molecular graphs. Our results show that the structural cluster similarity information can indeed leverage the prediction performance of the base kernel, particularly when the dataset is structurally sparse and consequently structurally diverse. By additionally taking into account a large number of unlabeled instances the performance of the structural cluster kernel can further be improved.",

keywords = "cheminformatics, clustering, data mining, kernels, QSAR, suport vector machines",

author = "Madeleine Seeland and Andreas Karwath and Stefan Kramer",

year = "2012",

doi = "10.1145/2339530.2339614",

language = "English",

pages = "516--524",

}

TY - CONF

T1 - A structural cluster kernel for learning on graphs

AU - Seeland, Madeleine

AU - Karwath, Andreas

AU - Kramer, Stefan

PY - 2012

Y1 - 2012

N2 - In recent years, graph kernels have received considerable interest within the machine learning and data mining community. Here, we introduce a novel approach enabling kernel methods to utilize additional information hidden in the structural neighborhood of the graphs under consideration. Our novel structural cluster kernel (SCK) incorporates similarities induced by a structural clustering algorithm to improve state-of-the-art graph kernels. The approach taken is based on the idea that graph similarity can not only be described by the similarity between the graphs themselves, but also by the similarity they possess with respect to their structural neighborhood. We applied our novel kernel in a supervised and a semi-supervised setting to regression and classification problems on a number of real-world datasets of molecular graphs. Our results show that the structural cluster similarity information can indeed leverage the prediction performance of the base kernel, particularly when the dataset is structurally sparse and consequently structurally diverse. By additionally taking into account a large number of unlabeled instances the performance of the structural cluster kernel can further be improved.

AB - In recent years, graph kernels have received considerable interest within the machine learning and data mining community. Here, we introduce a novel approach enabling kernel methods to utilize additional information hidden in the structural neighborhood of the graphs under consideration. Our novel structural cluster kernel (SCK) incorporates similarities induced by a structural clustering algorithm to improve state-of-the-art graph kernels. The approach taken is based on the idea that graph similarity can not only be described by the similarity between the graphs themselves, but also by the similarity they possess with respect to their structural neighborhood. We applied our novel kernel in a supervised and a semi-supervised setting to regression and classification problems on a number of real-world datasets of molecular graphs. Our results show that the structural cluster similarity information can indeed leverage the prediction performance of the base kernel, particularly when the dataset is structurally sparse and consequently structurally diverse. By additionally taking into account a large number of unlabeled instances the performance of the structural cluster kernel can further be improved.

KW - cheminformatics, clustering, data mining, kernels, QSAR, suport vector machines

U2 - 10.1145/2339530.2339614

DO - 10.1145/2339530.2339614

M3 - Paper

SP - 516

EP - 524

ER -

A structural cluster kernel for learning on graphs

Abstract

Keywords

Access to Document

Fingerprint

Cite this