Abstract
We propose a differential versioning based data storage (DiVers) architecture for distributed storage systems, which relies on a novel erasure coding technique that exploits sparsity across versions. The emphasis of this work is to demonstrate how sparsity exploiting codes (SEC), originally designed for I/O optimization, can be extended to significantly reduce storage overhead in a repository of versioned data. In addition to facilitating reduced storage, we address some key reliability aspects for DiVers such as (i) mechanisms to deploy the coding technique with arbitrarily varying size of data across versions, and (ii) investigating the right allocation strategy for the encoded blocks over a network of distributed nodes across different versions so as to achieve the best fault tolerance. We also discuss system issues related to the management of data structures for accessing and manipulating the files over the differential versions.
| Original language | English |
|---|---|
| Pages (from-to) | 47-62 |
| Number of pages | 16 |
| Journal | Future Generation Computer Systems |
| Volume | 59 |
| DOIs | |
| Publication status | Published - Jun 2016 |
Bibliographical note
Publisher Copyright:© 2016 Elsevier B.V. All rights reserved.
Keywords
- Datacenter networking
- Erasure coding
- Fault tolerance
- Version management
ASJC Scopus subject areas
- Software
- Hardware and Architecture
- Computer Networks and Communications
Fingerprint
Dive into the research topics of 'DiVers: An erasure code based storage architecture for versioning exploiting sparsity'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver