@inproceedings{DBLP:conf/vldb/WienerN94, author = {Janet L. Wiener and Jeffrey F. Naughton}, editor = {Jorge B. Bocca and Matthias Jarke and Carlo Zaniolo}, title = {Bulk Loading into an OODB: A Performance Study}, booktitle = {VLDB'94, Proceedings of 20th International Conference on Very Large Data Bases, September 12-15, 1994, Santiago de Chile, Chile}, publisher = {Morgan Kaufmann}, year = {1994}, isbn = {1-55860-153-8}, pages = {120-131}, ee = {db/conf/vldb/vldb94-120.html}, crossref = {DBLP:conf/vldb/94}, bibsource = {DBLP, http://dblp.uni-trier.de} }
Object-oriented database (OODB) users bring with them large quantities of legacy data (megabytes and even gigabytes). In addition, scientific OODB users continually generate new data. All this data must be loaded into the OODB. Every relational database system has a load utility, but most OODBs do not. The process of loading data into an OODB is complicated by inter-object references, or relationships, in the data. These relationships are expressed in the OODB as object identifiers, which are not known at the time the load data is generated; they may contain cycles; and there may be implicit system-maintained inverse relationships that must also be stored.
We introduce seven algorithms for loading data into an OODB that examine different techniques for dealing with circular and inverse relationships. We present a performance study based on both an analytic model and an implementation of all seven algorithms on top of the Shore object repository. Our study demonstrates that it is important to choose a load algorithm carefully; in some cases the best algorithm achieved an improvement of one to two orders of magnitude over the naive algorithm.
Copyright © 1994 by the VLDB Endowment. Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by the permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment.