Skip to main navigation Skip to search Skip to main content

DiVers: An erasure code based storage architecture for versioning exploiting sparsity

Research output: Contribution to journalArticlepeer-review

Abstract

We propose a differential versioning based data storage (DiVers) architecture for distributed storage systems, which relies on a novel erasure coding technique that exploits sparsity across versions. The emphasis of this work is to demonstrate how sparsity exploiting codes (SEC), originally designed for I/O optimization, can be extended to significantly reduce storage overhead in a repository of versioned data. In addition to facilitating reduced storage, we address some key reliability aspects for DiVers such as (i) mechanisms to deploy the coding technique with arbitrarily varying size of data across versions, and (ii) investigating the right allocation strategy for the encoded blocks over a network of distributed nodes across different versions so as to achieve the best fault tolerance. We also discuss system issues related to the management of data structures for accessing and manipulating the files over the differential versions.

Original languageEnglish
Pages (from-to)47-62
Number of pages16
JournalFuture Generation Computer Systems
Volume59
DOIs
Publication statusPublished - Jun 2016

Bibliographical note

Publisher Copyright:
© 2016 Elsevier B.V. All rights reserved.

Keywords

  • Datacenter networking
  • Erasure coding
  • Fault tolerance
  • Version management

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'DiVers: An erasure code based storage architecture for versioning exploiting sparsity'. Together they form a unique fingerprint.

Cite this