go back

Volume 14, No. 6

Efficient Bi-triangle Counting for Large Bipartite Networks

Authors:
Yixing Yang (University of New South Wales), Yixiang Fang (School of Data Science, The Chinese University of Hong Kong, Shenzhen), Maria Orlowska (Polish-Japanese Institute of Information Technology), Wenjie Zhang (University of New South Wales), Xuemin Lin (University of New South Wales)

Abstract

A bipartite network is a network with two disjoint vertex sets and its edges only exist between vertices from different sets. It has received much interest since it can be used to model the relationship between two different sets of objects in many applications (e.g., the relationship between users and items in E-commerce). In this paper, we study the problem of efficient bi-triangle counting for a large bipartite network, where a bi-triangle is a cycle with three vertices from one vertex set and three vertices from another vertex set. Counting bi-triangles has found many real applications such as computing the transitivity coefficient and clustering coefficient for bipartite networks. To enable efficient bi-triangle counting, we first develop a baseline algorithm relying on the observation that each bi-triangle can be considered as the join of three wedges. Then, we propose a more sophisticated algorithm which regards a bi-triangle as the join of two super-wedges, where a wedge is a path with two edges while a super-wedge is a path with three edges. We further optimize the algorithm by ranking vertices according to their degrees. We have performed extensive experiments on both real and synthetic bipartite networks, where the largest one contains more than one billion edges, and the results show that the proposed solutions are up to five orders of magnitude faster than the baseline method.

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy