go back
go back
Volume 18, No. 2
Accuracy-enhanced Sparse Vector Technique with Exponential Noise and Optimal Threshold Correction
Abstract
The Sparse Vector Technique (SVT) is one of the most fundamental tools in differential privacy (DP). It works as a backbone for adap- tive data analysis by answering a sequence of queries on a given dataset, and gleaning useful information in a privacy-preserving manner. Unlike the typical private query releases that directly pub- licize the noisy query results, SVT is less informative—it keeps the noisy query results to itself and only reveals a binary bit for each query, indicating whether the query result surpasses a predened threshold. To provide a rigorous DP guarantee for SVT, prior works in the literature adopt a conservative privacy analysis by assuming the direct disclosure of noisy query results as in typical private query releases. This approach, however, hinders SVT from achiev- ing higher query accuracy due to an overestimation of the privacy risks, which further leads to an excessive noise injection using the Laplacian or Gaussian noise for perturbation. Motivated by this, we provide a new privacy analysis for SVT by considering its less informative nature. Our analysis results not only broaden the range of applicable noise types for perturbation in SVT, but also identify the exponential noise as optimal among all evaluated noises (which, however, is usually deemed non-applicable in prior works). The main challenge in applying exponential noise to SVT is mitigating the sub-optimal performance due to the bias introduced by noise distributions. To address this, we develop a utility-oriented optimal threshold correction method and an appending strategy, which enhances the performance of SVT by increasing the precision and recall, respectively. The eectiveness of our proposed methods is substantiated both theoretically and empirically, demonstrating signicant improvements up to 50% across evaluated metrics.
PVLDB is part of the VLDB Endowment Inc.
Privacy Policy