TY - GEN
T1 - A linked data approach to sharing workflows and workflow results
AU - Roos, Marco
AU - Bechhofer, Sean
AU - Zhao, Jun
AU - Missier, Paolo
AU - Newman, David R.
AU - De Roure, David
AU - Marshall, M. Scott
PY - 2010
Y1 - 2010
N2 - A bioinformatics analysis pipeline is often highly elaborate, due to the inherent complexity of biological systems and the variety and size of datasets. A digital equivalent of the 'Materials and Methods' section in wet laboratory publications would be highly beneficial to bioinformatics, for evaluating evidence and examining data across related experiments, while introducing the potential to find associated resources and integrate them as data and services. We present initial steps towards preserving bioinformatics 'materials and methods' by exploiting the workflow paradigm for capturing the design of a data analysis pipeline, and RDF to link the workflow, its component services, run-time provenance, and a personalized biological interpretation of the results. An example shows the reproduction of the unique graph of an analysis procedure, its results, provenance, and personal interpretation of a text mining experiment. It links data from Taverna, myExperiment.org, BioCatalogue.org, and ConceptWiki.org. The approach is relatively 'light-weight' and unobtrusive to bioinformatics users.
AB - A bioinformatics analysis pipeline is often highly elaborate, due to the inherent complexity of biological systems and the variety and size of datasets. A digital equivalent of the 'Materials and Methods' section in wet laboratory publications would be highly beneficial to bioinformatics, for evaluating evidence and examining data across related experiments, while introducing the potential to find associated resources and integrate them as data and services. We present initial steps towards preserving bioinformatics 'materials and methods' by exploiting the workflow paradigm for capturing the design of a data analysis pipeline, and RDF to link the workflow, its component services, run-time provenance, and a personalized biological interpretation of the results. An example shows the reproduction of the unique graph of an analysis procedure, its results, provenance, and personal interpretation of a text mining experiment. It links data from Taverna, myExperiment.org, BioCatalogue.org, and ConceptWiki.org. The approach is relatively 'light-weight' and unobtrusive to bioinformatics users.
KW - Concept Web
KW - Digital preservation
KW - Linked Data
KW - Provenance
KW - Semantic Web
KW - Workflow
UR - http://www.scopus.com/inward/record.url?scp=78650292847&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-16558-0_29
DO - 10.1007/978-3-642-16558-0_29
M3 - Conference contribution
AN - SCOPUS:78650292847
SN - 3642165575
SN - 9783642165573
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 340
EP - 354
BT - Leveraging Applications of Formal Methods, Verification, and Validation - 4th International Symposium on Leveraging Applications, ISoLA 2010, Proceedings
T2 - 4th International Symposium on Leveraging Applications, ISoLA 2010
Y2 - 18 October 2010 through 21 October 2010
ER -