Abstract
Computational workflows consist of a series of steps in which data is generated, manipulated, analysed and transformed. Researchers use tools and techniques to capture the provenance associated with the data to aid reproducibility. The metadata collected not only helps in reproducing the computation but also aids in comparing the original and reproduced computations. In this paper, we present an approach, 'Why-Diff', to analyse the difference between two related computations by changing the artifacts and how the existing tools 'YesWorkflow' and 'NoWorkflow' record the changed artifacts.
Original language | English |
---|---|
Title of host publication | Proceedings - 2016 IEEE International Conference on Big Data, Big Data 2016 |
Editors | Ronay Ak, George Karypis, Yinglong Xia, Xiaohua Tony Hu, Philip S. Yu, James Joshi, Lyle Ungar, Ling Liu, Aki-Hiro Sato, Toyotaro Suzumura, Sudarsan Rachuri, Rama Govindaraju, Weijia Xu |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 3045-3051 |
Number of pages | 7 |
ISBN (Electronic) | 9781467390040 |
DOIs | |
Publication status | Published - 2016 |
Event | 4th IEEE International Conference on Big Data, Big Data 2016 - Washington, United States Duration: 5 Dec 2016 → 8 Dec 2016 |
Publication series
Name | Proceedings - 2016 IEEE International Conference on Big Data, Big Data 2016 |
---|
Conference
Conference | 4th IEEE International Conference on Big Data, Big Data 2016 |
---|---|
Country/Territory | United States |
City | Washington |
Period | 5/12/16 → 8/12/16 |
Bibliographical note
Publisher Copyright:© 2016 IEEE.
Keywords
- Metadata
- NoWorkflow
- Reproducibility
- Why-Diff
- YesWorkflow
ASJC Scopus subject areas
- Computer Networks and Communications
- Information Systems
- Hardware and Architecture