Analyzing provenance across heterogeneous provenance graphs

Wellington Oliveira*, Paolo Missier, Kary Ocaña, Daniel de Oliveira, Vanessa Braganholo

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Provenance generated by different workflow systems is generally expressed using different formats. This is not an issue when scientists analyze provenance graphs in isolation, or when they use the same workflow system. However, when analyzing heterogeneous provenance graphs from multiple systems poses a challenge. To address this problem we adopt ProvONE as an integration model, and show how different provenance databases can be converted to a global ProvONE schema. Scientists can then query this integrated database, exploring and linking provenance across several different workflows that may represent different implementations of the same experiment. To illustrate the feasibility of our approach, we developed conceptual mappings between the provenance databases of two workflow systems (e-Science Central and SciCumulus). We provide cartridges that implement these mappings and generate an integrated provenance database expressed as Prolog facts. To demonstrate its usage, we have developed Prolog rules that enable scientists to query the integrated database.

Original languageEnglish
Title of host publicationProvenance and Annotation of Data and Processes - 6th International Provenance and Annotation Workshop, IPAW 2016, Proceedings
EditorsBoris Glavic, Marta Mattoso
PublisherSpringer Verlag
Pages57-70
Number of pages14
ISBN (Print)9783319405926
DOIs
Publication statusPublished - 2016
Event6th International Provenance and Annotation Workshop, IPAW 2016 - McLean, United States
Duration: 7 Jun 20168 Jun 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9672
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th International Provenance and Annotation Workshop, IPAW 2016
Country/TerritoryUnited States
CityMcLean
Period7/06/168/06/16

Bibliographical note

Publisher Copyright:
© Springer International Publishing Switzerland 2016.

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Analyzing provenance across heterogeneous provenance graphs'. Together they form a unique fingerprint.

Cite this