Facilitating reproducible research by investigating computational metadata

Priyaa Thavasimani, Paolo Missier

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Computational workflows consist of a series of steps in which data is generated, manipulated, analysed and transformed. Researchers use tools and techniques to capture the provenance associated with the data to aid reproducibility. The metadata collected not only helps in reproducing the computation but also aids in comparing the original and reproduced computations. In this paper, we present an approach, 'Why-Diff', to analyse the difference between two related computations by changing the artifacts and how the existing tools 'YesWorkflow' and 'NoWorkflow' record the changed artifacts.

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE International Conference on Big Data, Big Data 2016
EditorsRonay Ak, George Karypis, Yinglong Xia, Xiaohua Tony Hu, Philip S. Yu, James Joshi, Lyle Ungar, Ling Liu, Aki-Hiro Sato, Toyotaro Suzumura, Sudarsan Rachuri, Rama Govindaraju, Weijia Xu
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages3045-3051
Number of pages7
ISBN (Electronic)9781467390040
DOIs
Publication statusPublished - 2016
Event4th IEEE International Conference on Big Data, Big Data 2016 - Washington, United States
Duration: 5 Dec 20168 Dec 2016

Publication series

NameProceedings - 2016 IEEE International Conference on Big Data, Big Data 2016

Conference

Conference4th IEEE International Conference on Big Data, Big Data 2016
Country/TerritoryUnited States
CityWashington
Period5/12/168/12/16

Bibliographical note

Publisher Copyright:
© 2016 IEEE.

Keywords

  • Metadata
  • NoWorkflow
  • Reproducibility
  • Why-Diff
  • YesWorkflow

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Facilitating reproducible research by investigating computational metadata'. Together they form a unique fingerprint.

Cite this