go back

Volume 14, No. 6

Fast Algorithm for Anchor Graph Hashing

Authors:
Yasuhiro Fujiwara (NTT Communication Science Laboratories), Sekitoshi Kanai (NTT Software Innovation Center), Yasutoshi Ida (NTT Software Innovation Center), Atsutoshi Kumagai (NTT Software Innovation Center), Naonori Ueda (NTT Communication Science Labs.)

Abstract

Anchor graph hashing is used in many applications such as cancer detection, web page classification, and drug discovery. It computes the hash codes from the eigenvectors of the matrix representing the similarities between data points and anchor points; anchors refer to the points representing the data distribution. In performing an approximate nearest neighbor search, the hash codes of a query data point are determined by identifying its closest anchor points. Anchor graph hashing, however, incurs high computation cost since (1) the computation cost of obtaining the eigenvectors is quadratic to the number of anchor points, and (2) the similarities of the query data point to all the anchor points must be computed. Our proposal, Tridiagonal hashing, increases the efficiency of anchor graph hashing because of its two advances: (1) we apply a graph clustering algorithm to compute the eigenvectors from the tridiagonal matrix obtained from the similarities between data points and anchor points, and (2) we detect anchor points closest to the query data point by using a dimensionality reduction approach. Experiments show that our approach is several orders of magnitude faster than the previous approaches. Besides, it yields high search accuracy than the original anchor graph hashing approach.

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy