go back

Volume 15, No. 2

Butterfly Counting on Uncertain Bipartite Networks

Authors:
Alexander Zhou (Hong Kong University of Science and Technology)* Yue Wang (Shenzhen Institute of Computing Sciences, Shenzhen University.) Lei Chen (Hong Kong University of Science and Technology)

Abstract

When considering uncertain bipartite networks, the number of instances of the popular graphlet structure the butterfly may be used as an important metric to quickly gauge information about the network. This Uncertain Butterfly Count has practical usages in a variety of areas such as biomedical/biological fields, E-Commerce and road networks. In this paper we formally define the uncertain butterfly structure (in which the existential probability of the butterfly is greater than or equal to some user-defined threshold t)as well as the Uncertain Butterfly Count Problem (to determine the number of unique instances of this structure on any uncertain bipartite network). We then examine exact solutions by proposing a non-trivial baseline solution (π‘ˆπ΅πΉπΆ) as well as an improved solution (πΌπ‘ˆπ΅πΉπΆ) which reduces the time complexity and employs heuristics to further reduce the runtime in practice. In addition to exact solutions, we propose two approximate solutions via sampling, π‘ˆπ΅π‘† and 𝑃𝐸𝑆, which can be used to quickly estimate the Uncertain Butterfly Count, a powerful tool when the exact count is unnecessary. Using a range of networks with different edge existential probability distributions, we validate the efficiency and effectiveness of our solutions.

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy