Visualization of Tree-Structured Data Through Generative Topographic Mapping

N Gianniotis; Peter Tino

doi:10.1109/TNN.2008.2001000

Visualization of Tree-Structured Data Through Generative Topographic Mapping

N Gianniotis, Peter Tino

Computer Science

Research output: Contribution to journal › Article

22 Citations (Scopus)

Abstract

In this paper, we present a probabilistic generative approach for constructing topographic maps of tree-structured data. Our model defines a low-dimensional manifold of local noise models, namely, (hidden) Markov tree models, induced by a smooth mapping from low-dimensional latent space. We contrast our approach with that of topographic map formation using recursive neural-based techniques, namely, the self-organizing map for structured data (SOMSD) (Hagenbuchner , 2003). The probabilistic nature of our model brings a number of benefits: 1) naturally defined cost function that drives the model optimization; 2) principled model comparison and testing for overfitting; 3) a potential for transparent interpretation of the map by inspecting the underlying local noise models; 4) natural accommodation of alternative local noise models implicitly expressing different notions of structured data similarity. Furthermore, in contrast with the recursive neural-based approaches, the smooth nature of the mapping from the latent space to the local model space allows for calculation of magnification factors--a useful tool for the detection of data clusters. We demonstrate our approach on three data sets: a toy data set, an artificially generated data set, and on a data set of images represented as quadtrees.

Original language	English
Pages (from-to)	1468-1493
Number of pages	26
Journal	IEEE Transactions on Neural Networks
Volume	19
Issue number	8
DOIs	https://doi.org/10.1109/TNN.2008.2001000
Publication status	Published - 1 Aug 2008

Keywords

topographic mapping
structured data
hidden Markov tree model (HMTM)

Access to Document

10.1109/TNN.2008.2001000

Cite this

@article{4db21dd7d24e41558fa27ed615ace1e5,

title = "Visualization of Tree-Structured Data Through Generative Topographic Mapping",

abstract = "In this paper, we present a probabilistic generative approach for constructing topographic maps of tree-structured data. Our model defines a low-dimensional manifold of local noise models, namely, (hidden) Markov tree models, induced by a smooth mapping from low-dimensional latent space. We contrast our approach with that of topographic map formation using recursive neural-based techniques, namely, the self-organizing map for structured data (SOMSD) (Hagenbuchner , 2003). The probabilistic nature of our model brings a number of benefits: 1) naturally defined cost function that drives the model optimization; 2) principled model comparison and testing for overfitting; 3) a potential for transparent interpretation of the map by inspecting the underlying local noise models; 4) natural accommodation of alternative local noise models implicitly expressing different notions of structured data similarity. Furthermore, in contrast with the recursive neural-based approaches, the smooth nature of the mapping from the latent space to the local model space allows for calculation of magnification factors--a useful tool for the detection of data clusters. We demonstrate our approach on three data sets: a toy data set, an artificially generated data set, and on a data set of images represented as quadtrees.",

keywords = "topographic mapping, structured data, hidden Markov tree model (HMTM)",

author = "N Gianniotis and Peter Tino",

year = "2008",

month = aug,

day = "1",

doi = "10.1109/TNN.2008.2001000",

language = "English",

volume = "19",

pages = "1468--1493",

journal = "IEEE Transactions on Neural Networks",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

number = "8",

}

TY - JOUR

T1 - Visualization of Tree-Structured Data Through Generative Topographic Mapping

AU - Gianniotis, N

AU - Tino, Peter

PY - 2008/8/1

Y1 - 2008/8/1

N2 - In this paper, we present a probabilistic generative approach for constructing topographic maps of tree-structured data. Our model defines a low-dimensional manifold of local noise models, namely, (hidden) Markov tree models, induced by a smooth mapping from low-dimensional latent space. We contrast our approach with that of topographic map formation using recursive neural-based techniques, namely, the self-organizing map for structured data (SOMSD) (Hagenbuchner , 2003). The probabilistic nature of our model brings a number of benefits: 1) naturally defined cost function that drives the model optimization; 2) principled model comparison and testing for overfitting; 3) a potential for transparent interpretation of the map by inspecting the underlying local noise models; 4) natural accommodation of alternative local noise models implicitly expressing different notions of structured data similarity. Furthermore, in contrast with the recursive neural-based approaches, the smooth nature of the mapping from the latent space to the local model space allows for calculation of magnification factors--a useful tool for the detection of data clusters. We demonstrate our approach on three data sets: a toy data set, an artificially generated data set, and on a data set of images represented as quadtrees.

AB - In this paper, we present a probabilistic generative approach for constructing topographic maps of tree-structured data. Our model defines a low-dimensional manifold of local noise models, namely, (hidden) Markov tree models, induced by a smooth mapping from low-dimensional latent space. We contrast our approach with that of topographic map formation using recursive neural-based techniques, namely, the self-organizing map for structured data (SOMSD) (Hagenbuchner , 2003). The probabilistic nature of our model brings a number of benefits: 1) naturally defined cost function that drives the model optimization; 2) principled model comparison and testing for overfitting; 3) a potential for transparent interpretation of the map by inspecting the underlying local noise models; 4) natural accommodation of alternative local noise models implicitly expressing different notions of structured data similarity. Furthermore, in contrast with the recursive neural-based approaches, the smooth nature of the mapping from the latent space to the local model space allows for calculation of magnification factors--a useful tool for the detection of data clusters. We demonstrate our approach on three data sets: a toy data set, an artificially generated data set, and on a data set of images represented as quadtrees.

KW - topographic mapping

KW - structured data

KW - hidden Markov tree model (HMTM)

U2 - 10.1109/TNN.2008.2001000

DO - 10.1109/TNN.2008.2001000

M3 - Article

C2 - 18701375

VL - 19

SP - 1468

EP - 1493

JO - IEEE Transactions on Neural Networks

JF - IEEE Transactions on Neural Networks

IS - 8

ER -

Visualization of Tree-Structured Data Through Generative Topographic Mapping

Abstract

Keywords

Access to Document

Fingerprint

Cite this