@inproceedings{DBLP:conf/vldb/GibbonsMP97, author = {Phillip B. Gibbons and Yossi Matias and Viswanath Poosala}, editor = {Matthias Jarke and Michael J. Carey and Klaus R. Dittrich and Frederick H. Lochovsky and Pericles Loucopoulos and Manfred A. Jeusfeld}, title = {Fast Incremental Maintenance of Approximate Histograms}, booktitle = {VLDB'97, Proceedings of 23rd International Conference on Very Large Data Bases, August 25-29, 1997, Athens, Greece}, publisher = {Morgan Kaufmann}, year = {1997}, isbn = {1-55860-470-7}, pages = {466-475}, ee = {db/conf/vldb/GibbonsMP97.html}, crossref = {DBLP:conf/vldb/97}, bibsource = {DBLP, http://dblp.uni-trier.de} }
Many commercial database systems maintain histograms to summarize the contents of large relations and permit efficient estimation of query result sizes for use in query optimizers. Delaying the propagation of database updates to the histogram often introduces errors in the estimation. This paper presents new sampling-based approaches for incremental maintenance of approximate histograms. By scheduling updates to the histogram based on the updates to the database, our techniques are the first to maintain histograms effectively up-to-date at all times and avoid computing overheads when unnecessary. Our techniques provide highly-accurate approximate histograms belonging to the equi- depth and Compressed classes. Experimental results show that our new approaches provide orders of magnitude more accurate estimation than previous approaches.
An important aspect employed by these new approaches is a backing sample, an up-to-date random sample of the tuples currently in a relation. We provide efficient solutions for maintaining a uniformly random sample of a relation in the presence of updates to the relation.
Copyright © 1997 by the VLDB Endowment. Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by the permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment.