M-tree: An Efficient Access Method for Similarity Search in Metric Spaces.
Paolo Ciaccia, Marco Patella, Pavel Zezula:
M-tree: An Efficient Access Method for Similarity Search in Metric Spaces.
VLDB 1997: 426-435@inproceedings{DBLP:conf/vldb/CiacciaPZ97,
author = {Paolo Ciaccia and
Marco Patella and
Pavel Zezula},
editor = {Matthias Jarke and
Michael J. Carey and
Klaus R. Dittrich and
Frederick H. Lochovsky and
Pericles Loucopoulos and
Manfred A. Jeusfeld},
title = {M-tree: An Efficient Access Method for Similarity Search in Metric
Spaces},
booktitle = {VLDB'97, Proceedings of 23rd International Conference on Very
Large Data Bases, August 25-29, 1997, Athens, Greece},
publisher = {Morgan Kaufmann},
year = {1997},
isbn = {1-55860-470-7},
pages = {426-435},
ee = {db/conf/vldb/CiacciaPZ97.html},
crossref = {DBLP:conf/vldb/97},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
Abstract
A new access method, called M-tree, is proposed to organize and search
large data sets from a generic ``metric space", i.e. where object proximity
is only defined by a distance function satisfying the positivity, symmetry,
and triangle inequality postulates.
We detail algorithms for insertion of objects and split management, which
keep the M-tree always balanced - several heuristic split alternatives
are considered and experimentally evaluated.
Algorithms for similarity (range and k-nearest neighbors) queries
are also described.
Results from extensive experimentation with a prototype system are
reported, considering as the performance criteria the
number of page I/O's and the number of distance computations.
The results demonstrate that the M-tree indeed
extends the domain of applicability beyond the traditional vector spaces,
performs reasonably well in high-dimensional data spaces, and scales well
in case of growing files.
Copyright © 1997 by the VLDB Endowment.
Permission to copy without fee all or part of this material is granted provided that the copies are not made or
distributed for direct commercial advantage, the VLDB
copyright notice and the title of the publication and
its date appear, and notice is given that copying
is by the permission of the Very Large Data Base
Endowment. To copy otherwise, or to republish, requires
a fee and/or special permission from the Endowment.
Online Paper
CDROM Version: Load the CDROM "Volume 1 Issue 5, VLDB '89-'97" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
Printed Edition
Matthias Jarke, Michael J. Carey, Klaus R. Dittrich, Frederick H. Lochovsky, Pericles Loucopoulos, Manfred A. Jeusfeld (Eds.):
VLDB'97, Proceedings of 23rd International Conference on Very Large Data Bases, August 25-29, 1997, Athens, Greece.
Morgan Kaufmann 1997, ISBN 1-55860-470-7
Contents
Electronic Edition
From CS Dept.,
University Trier (Germany)
Software
The M-tree Software
References
- [AFS93]
- Rakesh Agrawal, Christos Faloutsos, Arun N. Swami:
Efficient Similarity Search In Sequence Databases.
FODO 1993: 69-84
- [BKK96]
- Stefan Berchtold, Daniel A. Keim, Hans-Peter Kriegel:
The X-tree : An Index Structure for High-Dimensional Data.
VLDB 1996: 28-39
- [BKSS90]
- Norbert Beckmann, Hans-Peter Kriegel, Ralf Schneider, Bernhard Seeger:
The R*-Tree: An Efficient and Robust Access Method for Points and Rectangles.
SIGMOD Conference 1990: 322-331
- [BO97]
- Tolga Bozkaya, Z. Meral Özsoyoglu:
Distance-Based Indexing for High-Dimensional Metric Spaces.
SIGMOD Conference 1997: 357-368
- [Bri95]
- Sergey Brin:
Near Neighbor Search in Large Metric Spaces.
VLDB 1995: 574-584
- [Chi94]
- Tzi-cker Chiueh:
Content-Based Image Indexing.
VLDB 1994: 582-593
- [FEF+94]
- Christos Faloutsos, Ron Barber, Myron Flickner, Jim Hafner, Wayne Niblack, Dragutin Petkovic, William Equitz:
Efficient and Effective Querying by Image Content.
J. Intell. Inf. Syst. 3(3/4): 231-262(1994)
- [FL95]
- Christos Faloutsos, King-Ip Lin:
FastMap: A Fast Algorithm for Indexing, Data-Mining and Visualization of Traditional and Multimedia Datasets.
SIGMOD Conference 1995: 163-174
- [FRM94]
- Christos Faloutsos, M. Ranganathan, Yannis Manolopoulos:
Fast Subsequence Matching in Time-Series Databases.
SIGMOD Conference 1994: 419-429
- [Gut84]
- Antonin Guttman:
R-Trees: A Dynamic Index Structure for Spatial Searching.
SIGMOD Conference 1984: 47-57
- [HNP95]
- Joseph M. Hellerstein, Jeffrey F. Naughton, Avi Pfeffer:
Generalized Search Trees for Database Systems.
VLDB 1995: 562-573
- [JD88]
- Anil K. Jain, Richard C. Dubes:
Algorithms for Clustering Data.
Prentice-Hall 1988
- [RKV95]
- Nick Roussopoulos, Stephen Kelley, Frédéic Vincent:
Nearest Neighbor Queries.
SIGMOD Conference 1995: 71-79
- [SRF87]
- Timos K. Sellis, Nick Roussopoulos, Christos Faloutsos:
The R+-Tree: A Dynamic Index for Multi-Dimensional Objects.
VLDB 1987: 507-518
- [Uhl91]
- Jeffrey K. Uhlmann:
Satisfying General Proximity/Similarity Queries with Metric Trees.
Inf. Process. Lett. 40(4): 175-179(1991)
- [VM95]
- Michael Vassilakopoulos, Yannis Manolopoulos:
Dynamic Inverted Quadtree: A Structure for Pictorial Databases.
Inf. Syst. 20(6): 483-500(1995)
- [WBKW96]
- Erling Wold, Thom Blum, Douglas Keislar, James Wheaton:
Content-Based Classification, Search, and Retrieval of Audio.
IEEE MultiMedia 3: 27-36(1996)
- [ZCR96]
- ...
Copyright © Tue Mar 16 02:22:06 2010
by Michael Ley (ley@uni-trier.de)