Clustering provenance: Facilitating provenance exploration through data abstraction

Linus Karsai, Alan Fekete, Judy Kay, Paolo Missier

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

As digital objects become increasingly important in people's lives, people may need to understand the provenance, or lineage and history, of an important digital object, to understand how it was produced. This is particularly important for objects created from large, multi-source collections of personal data. As the metadata describing provenance, Provenance Data, is commonly represented as a labelled directed acyclic graph, the challenge is to create effective interfaces onto such graphs so that people can understand the provenance of key digital objects. This unsolved problem is especially challenging for the case of novice and intermittent users and complex provenance graphs. We tackle this by creating an interface based on a clustering approach. This was designed to enable users to view provenance graphs, and to simplify complex graphs by combining several nodes. Our core contribution is the design of a prototype interface that supports clustering and its analytic evaluation in terms of desirable properties of visualisation interfaces.

Original languageEnglish
Title of host publicationHILDA 2016 - Proceedings of the Workshop on Human-In-the-Loop Data Analytics
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450342070
DOIs
Publication statusPublished - 26 Jun 2016
Event1st Workshop on Human-in-the-Loop Data Analytics, HILDA 2016 - San Francisco, United States
Duration: 26 Jun 2016 → …

Publication series

NameHILDA 2016 - Proceedings of the Workshop on Human-In-the-Loop Data Analytics

Conference

Conference1st Workshop on Human-in-the-Loop Data Analytics, HILDA 2016
Country/TerritoryUnited States
CitySan Francisco
Period26/06/16 → …

Bibliographical note

Publisher Copyright:
© 2016 Copyright held by the owner/author(s).

Keywords

  • Large-scale graphs
  • Provenance
  • Visualisation

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Clustering provenance: Facilitating provenance exploration through data abstraction'. Together they form a unique fingerprint.

Cite this