Scaling whole exome data processing using workflows on the cloud

Jacek Cała, Paolo Missier

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Whole exome / genome sequencing (WES/WGS) is poised to become a cornerstone of genetic testing for diagnosis in clinical practice, at population scale. The Cloud-e-Genome project, started in late 2013, addresses three architectural requirements in support of WES-based diagnosis, namely (i) scalability of the storage and computing resources required to extract variants from sequences, (ii) flexibility in the design and evolution of WES processing pipelines, and (iii) reproducibility of the results. Our approach involves using a scientific workflow model to program the pipelines for flexibility, deploying the workflows on the Azure cloud for scalability, and recording the provenance of workflow execution, for reproduciblity. In this discussion paper we elaborate on our design choices, the associated challenges, and the expected benefits.

Original languageEnglish
Title of host publication22nd Italian Symposium on Advanced Database Systems, SEBD 2014
PublisherUniversita Reggio Calabria and Centro di Competenza (ICT-SUD)
Pages201-208
Number of pages8
ISBN (Electronic)9781634391450
Publication statusPublished - 2014
Event22nd Italian Symposium on Advanced Database Systems, SEBD 2014 - Castellammare di Stabia, Italy
Duration: 16 Jun 201418 Jun 2014

Publication series

Name22nd Italian Symposium on Advanced Database Systems, SEBD 2014

Conference

Conference22nd Italian Symposium on Advanced Database Systems, SEBD 2014
Country/TerritoryItaly
CityCastellammare di Stabia
Period16/06/1418/06/14

Bibliographical note

Publisher Copyright:
Copyright © (2014) by Universita Reggio Calabria & Centro di Competenza (ICT-SUD) All rights reserved.

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Scaling whole exome data processing using workflows on the cloud'. Together they form a unique fingerprint.

Cite this