Performance Introspection of Graph Databases

Macko, P., Margo, D., Seltzer, M.,

Abstract

The explosion of graph data in social and biological net- works, recommendation systems, provenance databases, etc. makes graph storage and processing of paramount impor- tance. We present a performance introspection framework for graph databases, PIG, which provides both a toolset and methodology for understanding graph database perfor- mance. PIG consists of a hierarchical collection of bench- marks that compose to produce performance models; the models provide a way to illuminate the strengths and weak- nesses of a particular implementation. The suite has three layers of benchmarks: primitive operations, composite access patterns, and graph algorithms. While the framework could be used to compare different graph database systems, its primary goal is to help explain the observed performance of a particular system. Such introspection allows one to evalu- ate the degree to which systems exploit their knowledge of graph access patterns. We present both the PIG methodol- ogy and infrastructure and then demonstrate its efficacy by analyzing the popular Neo4j and DEX graph databases.
Postscript Slides