go back

Volume 14, No. 6

Dealer: An End-to-End Model Marketplace with Differential Privacy

Authors:
Jinfei Liu (Emory University/Georgia Institute of Technology), Jian Lou (Emory University), Junxu Liu (Emory University), Li Xiong (Emory University), Jian Pei (Simon Fraser University), Jimeng Sun (CS)

Abstract

Data-driven machine learning has become ubiquitous. A marketplace for machine learning models connects data owners and model buyers, and can dramatically facilitate data-driven machine learning applications. In this paper, we take a formal data marketplace perspective and propose the first en\textbf{\underline{D}}-to-end mod\textbf{\underline{e}}l m\textbf{\underline{a}}rketp\textbf{\underline{l}}ace with diff\textbf{\underline{e}}rential p\textbf{\underline{r}}ivacy (\emph{Dealer}) towards answering the following questions: \emph{How to formulate data owners' compensation functions and model buyers' price functions? How can the broker determine prices for a set of models to maximize the revenue with arbitrage-free guarantee, and train a set of models with maximum data coverage given a manufacturing budget to remain competitive}? For the former, we propose compensation function for each data owner based on Shapley value and privacy sensitivity, and price function for each model buyer based on data coverage sensitivity and noise sensitivity. Both privacy sensitivity and noise sensitivity are measured by the level of differential privacy. For the latter, we formulate two optimization problems for model pricing and model training, and propose efficient dynamic programming algorithms. Experiment results on the real breast cancer dataset and synthetic datasets justify the design of \emph{Dealer} and verify the efficiency and effectiveness of the proposed algorithms.

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy