Data lineage model for taverna workflows with lightweight annotation requirements

Paolo Missier, Khalid Belhajjame, Jun Zhao, Marco Roos, Carole Goble

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The provenance, or lineage, of a workflow data product can be reconstructed by keeping a complete trace of workflow execution. This lineage information, however, is likely to be both imprecise, because of the black-box nature of the services that compose the workflow, and noisy, because of the many trivial data transformations that obscure the intended purpose of the workflow. In this paper we argue that these shortcomings can be alleviated by introducing a small set of optional lightweight annotations to the workflow, in a principled way. We begin by presenting a baseline, annotation-free lineage model for the Taverna workflow system, and then show how the proposed annotations improve the results of fundamental lineage queries.

Original languageEnglish
Title of host publicationProvenance and Annotation of Data and Processes - 2nd International Provenance and Annotation Workshop, IPAW 2008, Revised Selected Papers
EditorsJuliana Freire, David Koop, Juliana Freire, Juliana Freire, Luc Moreau
PublisherSpringer Verlag
Pages17-30
Number of pages14
ISBN (Print)9783540899648
DOIs
Publication statusPublished - 2008
Event2nd International Provenance and Annotation Workshop, IPAW 2008 - Salt Lake City, United States
Duration: 17 Jun 200818 Jun 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5272
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference2nd International Provenance and Annotation Workshop, IPAW 2008
Country/TerritoryUnited States
CitySalt Lake City
Period17/06/0818/06/08

Bibliographical note

Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 2008.

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Data lineage model for taverna workflows with lightweight annotation requirements'. Together they form a unique fingerprint.

Cite this