go back

Volume 14, No. 11

TQEL: Framework for Query-Driven Linking of Top-K Entities in Social Media Blogs

Authors:
Abdulrahman Alsaudi (University of California Irvine), Yasser Altowim (King Abdulaziz City for Science and Technology), Sharad Mehrotra (U.C. Irvine), Yaming Yu (University of California Irvine)

Abstract

Social media analysis over blogs (such as tweets) often requires determining top-k mentions of a certain category (e.g., movies) in a collection (e.g., tweets collected over a given day). Such queries require entity linking (EL) function to be executed that is often expensive. We propose TQEL, a framework that minimizes the joint cost of EL calls and top-k query processing. The paper presents two variants - TQEL-exact and TQEL-approximate that retrieve the exact / approximate top-k results. TQEL-approximate, using a weaker stopping condition, achieves significantly improved performance (with the fraction of the cost of TQEL-exact) while providing strong probabilistic guarantees (over 2 orders of magnitude lower EL calls with 95% confidence threshold compared to TQEL-exact). TQEL-exact itself is orders of magnitude better compared to a naive approach that calls EL functions on the entire dataset.

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy