TAPER: query-aware, partition-enhancement for large, heterogenous graphs

Hugo Firth*, Paolo Missier

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Graph partitioning has long been seen as a viable approach to addressing Graph DBMS scalability. A partitioning, however, may introduce extra query processing latency unless it is sensitive to a specific query workload, and optimised to minimise inter-partition traversals for that workload. Additionally, it should also be possible to incrementally adjust the partitioning in reaction to changes in the graph topology, the query workload, or both. Because of their complexity, current partitioning algorithms fall short of one or both of these requirements, as they are designed for offline use and as one-off operations. The TAPER system aims to address both requirements, whilst leveraging existing partitioning algorithms. TAPER takes any given initial partitioning as a starting point, and iteratively adjusts it by swapping chosen vertices across partitions, heuristically reducing the probability of inter-partition traversals for a given path queries workload. Iterations are inexpensive thanks to time and space optimisations in the underlying support data structures. We evaluate TAPER on two different large test graphs and over realistic query workloads. Our results indicate that, given a hash-based partitioning, TAPER reduces the number of inter-partition traversals by ∼ 80%; given an unweighted Metis partitioning, by ∼ 30%. These reductions are achieved within eight iterations and with the additional advantage of being workload-aware and usable online.

Original languageEnglish
Pages (from-to)85-115
Number of pages31
JournalDistributed and Parallel Databases
Volume35
Issue number2
DOIs
Publication statusPublished - 1 Jun 2017

Bibliographical note

Publisher Copyright:
© 2017, The Author(s).

Keywords

  • Graph databases
  • Graph repartitioning
  • Workload mining

ASJC Scopus subject areas

  • Software
  • Information Systems
  • Hardware and Architecture
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'TAPER: query-aware, partition-enhancement for large, heterogenous graphs'. Together they form a unique fingerprint.

Cite this