go back

Volume 15, No. 1

On Detecting Cherry-picked Generalizations

Authors:
Yin Lin (University of Michigan)* Brit Youngman (Tel-Aviv University) Yuval Moskovitch (University of Michigan) H. V. Jagadish (University of Michigan) Tova Milo (Tel Aviv University)

Abstract

Generalizing from detailed data to statements in a broader context is often critical for users to make sense of large data sets. Correspondingly, poorly constructed generalizations might convey misleading information even if the statements are technically supported by the data. For example, a cherry-picked level of aggregation could obscure substantial sub-groups that oppose the generalization. We present a framework for detecting and explaining cherry-picked generalizations by refining aggregate queries. We present a scoring method to indicate the appropriateness of the generalization. We design efficient algorithms for score computation. For providing a better understanding of the resulting score, we also formulate practical explanation tasks to disclose significant counterexamples and provide better alternatives to the statement. We conduct experiments using real-world datasets and examples to show the effectiveness of our proposed evaluation metric and the efficiency of our algorithmic framework.

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy