go back
go back
Volume 17, No. 11
Enriching Relations with Additional Attributes for ER
Abstract
This paper studies a new problem of relation enrichment. Given a relation ๐ท of schema ๐ and a knowledge graph ๐บ with overlapping information, it is to identify a small number of relevant features from ๐บ, and extend schema ๐ with the additional attributes, to maximally improve the accuracy of resolving entities represented by the tuples of ๐ท. We formulate the enrichment problem and show its intractability. Nonetheless, we propose a method to extract features from ๐บ that are diverse from the existing attributes of ๐ , minimize null values, and moreover, reduce false positives and false negatives of entity resolution (ER) models. The method links tuples and vertices that refer to the same entity, learns a robust policy to extract attributes via reinforcement learning, and jointly trains the policy and ER models. Moreover, we develop algorithms for (incrementally)enriching๐ท. Using real-life data, we experimentally verify that relation enrichment improves the accuracy of ER above 15.4% (percentage points) by adding 5 attributes, up to 33%.
PVLDB is part of the VLDB Endowment Inc.
Privacy Policy