Incremental workflow improvement through analysis of its data provenance

Paolo Missier*

*Corresponding author for this work

Research output: Contribution to conference (unpublished)Paperpeer-review

Abstract

Repeated executions of resource-intensive workflows over a large number of runs are commonly observed in e-science practice. We explore the hypothesis that, in some cases, provenance traces recorded for past runs of a workflow can be used to make future runs more efficient. This investigation is an initial step into the systematic study of the role that provenance analysis can play in the broader context of self-managing software systems. We have tested our hypothesis on a concrete case study involving a Chemical Engineering workflow deployed on a cloud infrastructure, where we can measure the cost of its repeated execution. Our approach involves augmenting the workflow with a feedback loop in which incremental analysis of the provenance of past runs is used to control some of the workflow steps in subsequent executions. We present initial experimental results and hint at future improvements as part of ongoing work.

Original languageEnglish
Publication statusPublished - 2011
Event3rd Workshop on the Theory and Practice of Provenance, TaPP 2011 - Heraklion, Crete, Greece
Duration: 20 Jun 201121 Jun 2011

Conference

Conference3rd Workshop on the Theory and Practice of Provenance, TaPP 2011
Country/TerritoryGreece
CityHeraklion, Crete
Period20/06/1121/06/11

Bibliographical note

Publisher Copyright:
© TaPP 2011.All right reserved.

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Incremental workflow improvement through analysis of its data provenance'. Together they form a unique fingerprint.

Cite this