go back

Volume 18, No. 2

COLOR: A Framework for Applying Graph Coloring to Subgraph Cardinality Estimation

Authors:
Kyle B Deeds, Diandre Miguel B Sabale, Moe Kayali, Dan Suciu

Abstract

Graph workloads pose a particularly challenging problem for query optimizers. They typically feature large queries made up of entirely many-to-manyjoinswithcomplexcorrelations.Thisputssignificant stress on traditional cardinality estimation methods which generally see catastrophic errors when estimating the size of queries with only a handful of joins. To overcome this, we propose COLOR, a frame-a handful of joins. To overcome this, we propose COLOR, a framework for subgraph cardinality estimation which applies insights from graph compression theory to produce a compact summary that captures the global topology of the data graph. Further, we identify several key optimizations that enable tractable estimation over this summary even for large query graphs. We then evaluate several de-summary even for large query graphs. We then evaluate several designs within this framework and find that they improve accuracy by up to 10³× over all competing methods while maintaining fast infer-up to 10³× over all competing methods while maintaining fast inference, a small memory footprint, efficient construction, and graceful degradation under updates. KEYWORDS Cardinality Estimation, Graph Databases, Graph Summarization, Query Optimization

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy