Abstract
Repeated executions of resource-intensive workflows over a large number of runs are commonly observed in e-science practice. We explore the hypothesis that, in some cases, provenance traces recorded for past runs of a workflow can be used to make future runs more efficient. This investigation is an initial step into the systematic study of the role that provenance analysis can play in the broader context of self-managing software systems. We have tested our hypothesis on a concrete case study involving a Chemical Engineering workflow deployed on a cloud infrastructure, where we can measure the cost of its repeated execution. Our approach involves augmenting the workflow with a feedback loop in which incremental analysis of the provenance of past runs is used to control some of the workflow steps in subsequent executions. We present initial experimental results and hint at future improvements as part of ongoing work.
Original language | English |
---|---|
Publication status | Published - 2011 |
Event | 3rd Workshop on the Theory and Practice of Provenance, TaPP 2011 - Heraklion, Crete, Greece Duration: 20 Jun 2011 → 21 Jun 2011 |
Conference
Conference | 3rd Workshop on the Theory and Practice of Provenance, TaPP 2011 |
---|---|
Country/Territory | Greece |
City | Heraklion, Crete |
Period | 20/06/11 → 21/06/11 |
Bibliographical note
Publisher Copyright:© TaPP 2011.All right reserved.
ASJC Scopus subject areas
- General Computer Science