go back
go back
Volume 15, No. 7
Stingy Sketch: A Sketch Framework for Accurate and Fast Frequency Estimation
Abstract
Recording the frequency of items in highly skewed data streams is a fundamental and hot problem in recent years. The literature demonstrates that sketch is the most promising solution. The typical metrics to measure a sketch are accuracy and speed, and existing sketches make only trade-offs between the two dimensions. Differently, we aim to optimize both the accuracy and speed at the same time. In this paper, we propose a new sketch framework called Stingy sketch with two key techniques: Bit-pinching Counter Tree(BCTree) and Prophet Queue (PQueue). The key idea of BCTree is to split a large fixed-size counter into many small nodes of a tree structure, and to use a precise encoding to perform carry-in operations with low processing overhead. The key idea of PQueueis to use pipelined prefetch technique to make most memory accesses happen in L2 cache without losing precision. Importantly,the two techniques are cooperative so that Stingy sketch can improve accuracy and speed simultaneously. Extensive experimental results show that Stingy sketch is up to 50% more accurate than the state of the art (SOTA) of accuracy-oriented sketches and is up to 33% faster than the SOTA of speed-oriented sketches.
PVLDB is part of the VLDB Endowment Inc.
Privacy Policy