Abstract
This paper presents an extension to the W3C PROV1 provenance model, aimed at representing process structure. Although the modelling of process structure is out of the scope of the PROV specification, it is beneficial when capturing and analyzing the provenance of data that is produced by programs or other formally encoded processes. In the paper, we motivate the need for such and extended model in the context of an ongoing large data federation and preservation project, DataONE2, where provenance traces of scientific workflow runs are captured and stored alongside the data products. We introduce new provenance relations for modelling process structure along with their usage patterns, and present sample queries that demonstrate their benefit.
Original language | English |
---|---|
Publication status | Published - 2013 |
Event | 5th Workshop on the Theory and Practice of Provenance, TaPP 2013 - Lombard, United States Duration: 2 Apr 2013 → 3 Apr 2013 |
Conference
Conference | 5th Workshop on the Theory and Practice of Provenance, TaPP 2013 |
---|---|
Country/Territory | United States |
City | Lombard |
Period | 2/04/13 → 3/04/13 |
Bibliographical note
Funding Information:Acknowledgments. Work supported in part by NSF-OCI DataONE #0830944 (for Víctor Cuevas-Vicenttín) and made possible by the voluntary work of members of the DataONE Provenance Working Group. Special thanks to Yaxing Wei from ORNL for the design and implementation of the VisTrails climate workflows. Khalid Belhajjame was supported by the myGrid platform grant.
Publisher Copyright:
© TaPP 2013 .All right reserved.
ASJC Scopus subject areas
- General Computer Science