go back

Volume 17, No. 11

Enriching Relations with Additional Attributes for ER

Authors:
Mengyi Yan, Wenfei Fan, Yaoshu Wang, Min Xie

Abstract

This paper studies a new problem of relation enrichment. Given a relation ๐ท of schema ๐‘… and a knowledge graph ๐บ with overlapping information, it is to identify a small number of relevant features from ๐บ, and extend schema ๐‘… with the additional attributes, to maximally improve the accuracy of resolving entities represented by the tuples of ๐ท. We formulate the enrichment problem and show its intractability. Nonetheless, we propose a method to extract features from ๐บ that are diverse from the existing attributes of ๐‘…, minimize null values, and moreover, reduce false positives and false negatives of entity resolution (ER) models. The method links tuples and vertices that refer to the same entity, learns a robust policy to extract attributes via reinforcement learning, and jointly trains the policy and ER models. Moreover, we develop algorithms for (incrementally)enriching๐ท. Using real-life data, we experimentally verify that relation enrichment improves the accuracy of ER above 15.4% (percentage points) by adding 5 attributes, up to 33%.

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy