go back

Volume 15, No. 11

Efficient Load-Balanced Butterfly Counting on GPU

Authors:
Qingyu Xu (RenMing University of China) Feng Zhang (Renmin University of China)* Zhiming Yao (Renmin University of China) Lv Lu (Renmin University of China) Xiaoyong Du (Renmin University of China) Dong Deng (Rutgers Universituy - New Brunswick) Bingsheng He (National University of Singapore)

Abstract

Butterfly counting is an important and costly operation in bipartite graphs. GPUs are popular parallel heterogeneous devices and can bring significant performance improvement for data science applications. Unfortunately, no work enables efficient butterfly counting on GPU currently. To fill this gap, we propose a GPU-based butterfly counting, called G-BFC. G-BFC addresses three main challenges. First, butterfly counting involves massive serial operations, which leads to severe synchronization overheads and performance degradation. We unlock the serial region and utilize the shared memory on GPU to handle it. Second, butterfly counting on GPU faces the workload imbalance problem. We develop a novel adaptive strategy to balance the workload among threads, improving efficiency. Third, butterfly counting in parallel suffers from the traversal of the huge amount of two-hop paths, also called wedges, in bipartite graphs. We develop a novel preprocessing strategy, which can effectively reduce the number of wedges to be traversed and memory cost. Experiments show that G-BFC brings significant performance benefits. On ten real datasets, G-BFC can achieve 19.8 times performance speedup over the state-of-the-art solution.

PVLDB is part of the VLDB Endowment Inc.

Privacy Policy