Akademska digitalna zbirka SLovenije - logo
E-resources
Full text
Peer reviewed
  • A memory efficient maximal ...
    Yu, Ting; Liu, Mengchi

    Parallel computing, September 2019, 2019-09-00, Volume: 87
    Journal Article

    •We propose a memory efficient method, CMC-bit, for maximal clique enumeration on real-world sparse graphs.•We use an efficient order strategy to reduce both the CPU cost in the candidate map construction and to decrease the candidate clique number.•The method has a stable total execution time and suits well in memory-limited scenarios.•We devise a parallel implementation of CMC-bit based on MapReduce.•The method is faster than baselines on real world datasets and its parallel implementation is scalable. Maximal clique enumeration (MCE) is a widely studied problem that plays a crucial role in structure mining of undirected graphs. The increasing scale of real-world graphs has brought the challenges of high memory cost and high CPU workload to the problem. In this paper, we propose a memory efficient method named CMC-bit for MCE on sparse graphs. It reduces the memory cost via minimizing the candidate cliques and representing them by the data structure bitset. It generates an appropriate order for the vertex set according to two optimized principles to reduce the CPU cost. We further design a partition-based CMC-bit algorithm with a one-side extending strategy to solve the memory-limited problem. We parallelize the CMC-bit algorithm based on MapReduce with a range-based partition strategy to make an optimal trade-off between the shuffling workload of graph decomposition and load balance in the Reduce phase. We conduct extensive experiments on 30 real-world datasets. The results demonstrate that both the CMC-bit algorithm and its parallel implementation significantly outperform the respective state-of-the-art algorithms in speed. We also show that the parallel CMC-bit algorithm achieves good performance on the scalability with respect to both the reducer number and the CPU number.