(Translated by https://www.hiragana.jp/)
Fast Maximal Clique Enumeration on Uncertain Graphs: A Pivot-based Approach | Proceedings of the 2022 International Conference on Management of Data skip to main content
10.1145/3514221.3526143acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Fast Maximal Clique Enumeration on Uncertain Graphs: A Pivot-based Approach

Published: 11 June 2022 Publication History
  • Get Citation Alerts
  • Abstract

    Maximal clique enumeration on uncertain graphs is a fundamental problem in uncertain graph analysis. In this paper, we study a problem of enumerating all maximal (k,n)-cliques on an uncertain graph G, where a vertex set H of G is a maximal (k,n)-clique if (1) H (|H| ≥ k) is a clique with probability no less than n, and (2) H is a maximal vertex set satisfying (1). The state-of-the-art algorithms for enumerating all maximal (k,n)-cliques are based on a set enumeration technique which are often very costly. This is because the set enumeration based techniques may explore all subsets of a maximal (k,n)-clique, thus resulting in many unnecessary computations. To overcome this issue, we propose several novel and efficient pivot-based algorithms to enumerate all maximal (k,n)-cliques based on a newly-developed pivot-based pruning principle. Our pivot-based pruning principle is very general which can be applied to speed up the enumeration of any maximal subgraph that satisfies a hereditary property. Here the hereditary property means that if a maximal subgraph H satisfies a property P, any subgraph of H also meets P. To the best of our knowledge, our work is the first to systematically explore the idea of pivot for maximal clique enumeration on uncertain graphs. In addition, we also develop a nontrivial size-constraint based pruning technique and a new graph reduction technique to further improve the efficiency. Extensive experiments on nine real-world graphs demonstrate the efficiency, effectiveness, and scalability of the proposed algorithms.

    Supplementary Material

    MP4 File (SIGMOD22-moddm293.mp4)
    Here we investigate the problem of enumerating all maximal cliques on uncertain graphs and develop a pivot-based approach to solve this issue. The main idea is based on the newly proposed general pivot principle for enumerating all hereditary subgraphs.

    References

    [1]
    Eytan Adar and Christopher Re. 2007. Managing uncertainty in social networks. IEEE Data Eng. Bull., Vol. 30, 2 (2007), 15--22.
    [2]
    Balabhaskar Balasundaram and Sergiy Butenko. 2006. Graph Domination, Coloring and Cliques in Telecommunications. In Handbook of Optimization in Telecommunications. Springer, 865--890.
    [3]
    Vladimir Batagelj and Matjaz Zaversnik. 2003. An O(m) Algorithm for Cores Decomposition of Networks. CoRR, Vol. cs.DS/0310049 (2003).
    [4]
    David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet Allocation. J. Mach. Learn. Res., Vol. 3 (2003), 993--1022.
    [5]
    Vladimir Boginski, Sergiy Butenko, and Panos M. Pardalos. 2005. Statistical analysis of financial networks. Comput. Stat. Data Anal., Vol. 48, 2 (2005), 431--443.
    [6]
    Francesco Bonchi, Francesco Gullo, Andreas Kaltenbrunner, and Yana Volkovich. 2014. Core decomposition of uncertain graphs. In KDD. 1316--1325.
    [7]
    Coenraad Bron and Joep Kerbosch. 1973. Finding All Cliques of an Undirected Graph (Algorithm 457). Commun. ACM, Vol. 16, 9 (1973), 575--576.
    [8]
    Frédéric Cazals and Chinmay Karande. 2006. Reporting maximal cliques: new insights into an old problem. Ph.,D. Dissertation. INRIA.
    [9]
    Lijun Chang, Jeffrey Xu Yu, and Lu Qin. 2013. Fast Maximal Cliques Enumeration in Sparse Graphs. Algorithmica, Vol. 66, 1 (2013), 173--186.
    [10]
    Xuelu Chen, Muhao Chen, Weijia Shi, Yizhou Sun, and Carlo Zaniolo. 2019. Embedding Uncertain Knowledge Graphs. In AAAI. 3363--3370.
    [11]
    Zhu-Mu Chen, Mi-Yen Yeh, and Tei-Wei Kuo. 2021. PASSLEAF: A Pool-bAsed Semi-Supervised LEArning Framework for Uncertain Knowledge Graph Embedding. In AAAI. 4019--4026.
    [12]
    Zi Chen, Long Yuan, Xuemin Lin, Lu Qin, and Jianye Yang. 2020. Efficient Maximal Balanced Clique Enumeration in Signed Networks. In WWW. 339--349.
    [13]
    Alessio Conte, Roberto Grossi, Andrea Marino, and Luca Versari. 2016. Sublinear-Space Bounded-Delay Enumeration for Massive Network Analytics: Maximal Cliques. In ICALP (LIPIcs, Vol. 55). 148:1--148:15.
    [14]
    Marcus T Dittrich, Gunnar W Klau, Andreas Rosenwald, Thomas Dandekar, and Tobias Müller. 2008. Identifying functional modules in protein--protein interaction networks: an integrated exact approach. Bioinformatics, Vol. 24, 13 (2008), i223--i231.
    [15]
    Jun Dong and Steve Horvath. 2007. Understanding network concepts in modules. BMC systems biology, Vol. 1, 1 (2007), 1--20.
    [16]
    David Eppstein, Maarten Lö ffler, and Darren Strash. 2010. Listing All Maximal Cliques in Sparse Graphs in Near-Optimal Time. In ISAAC, Vol. 6506. 403--414.
    [17]
    David Eppstein, Maarten Lö ffler, and Darren Strash. 2013. Listing All Maximal Cliques in Large Sparse Real-World Graphs. ACM J. Exp. Algorithmics, Vol. 18 (2013).
    [18]
    Yunjun Gao, Xiaoye Miao, Gang Chen, Baihua Zheng, Deng Cai, and Huiyong Cui. 2017. On efficiently finding reverse k-nearest neighbors over uncertain graphs. VLDB J., Vol. 26, 4 (2017), 467--492.
    [19]
    Anne-Claude Gavin, Markus Bösche, Roland Krause, Paola Grandi, Martina Marzioch, Andreas Bauer, Jörg Schultz, Jens M Rick, Anne-Marie Michon, Cristina-Maria Cruciat, et al. 2002. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature, Vol. 415, 6868 (2002), 141--147.
    [20]
    Eric Harley, Anthony Bonner, and Nathan Goodman. 2001. Uniform integration of genome mapping data using intersection graphs. Bioinformatics, Vol. 17, 6 (2001), 487--494.
    [21]
    William Hasenplaugh, Tim Kaler, Tao B. Schardl, and Charles E. Leiserson. 2014. Ordering heuristics for parallel graph coloring. In SPAA. 166--177.
    [22]
    Xin Huang, Wei Lu, and Laks V. S. Lakshmanan. 2016. Truss Decomposition of Probabilistic Graphs: Semantics and Algorithms. In SIGMOD. 77--90.
    [23]
    Rui Jiang, Zhidong Tu, Ting Chen, and Fengzhu Sun. 2006. Network motif identification in stochastic networks. Pro. Nat. Acad. Sci., Vol. 103, 25 (2006), 9404--9409.
    [24]
    Ruoming Jin, Lin Liu, and Charu C. Aggarwal. 2011a. Discovering highly reliable subgraphs in uncertain graphs. In SIGKDD. 992--1000.
    [25]
    Ruoming Jin, Lin Liu, Bolin Ding, and Haixun Wang. 2011b. Distance-Constraint Reachability Computation in Uncertain Graphs. Proc. VLDB Endow., Vol. 4, 9 (2011), 551--562.
    [26]
    Xiangyu Ke, Arijit Khan, and Leroy Lim Hong Quan. 2019. An In-Depth Comparison of s-t Reliability Algorithms over Uncertain Graphs. Proc. VLDB Endow., Vol. 12, 8 (2019), 864--876.
    [27]
    Arijit Khan, Francesco Bonchi, Aristides Gionis, and Francesco Gullo. 2014. Fast Reliability Search in Uncertain Graphs. In EDBT.
    [28]
    Ina Koch. 2001. Enumerating all connected maximal common subgraphs in two graphs. Theor. Comput. Sci., Vol. 250, 1--2 (2001), 1--30.
    [29]
    George Kollios, Michalis Potamias, and Evimaria Terzi. 2011. Clustering large probabilistic graphs. IEEE Trans. Knowl. Data Eng., Vol. 25, 2 (2011), 325--336.
    [30]
    George Kollios, Michalis Potamias, and Evimaria Terzi. 2013. Clustering Large Probabilistic Graphs. IEEE Trans. Knowl. Data Eng., Vol. 25, 2 (2013), 325--336.
    [31]
    Nevan J. Krogan, Gerard Cagney, Haiyuan Yu, Gouqing Zhong, Xinghua Guo, Alexandr Ignatchenko, Joyce Li, Shuye Pu, Nira Datta, Aaron P. Tikuisis, et al. 2006. Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature, Vol. 440, 7084 (2006), 637--643.
    [32]
    Rong-Hua Li, Qiangqiang Dai, Lu Qin, Guoren Wang, Xiaokui Xiao, Jeffrey Xu Yu, and Shaojie Qiao. 2018. Efficient Signed Clique Search in Signed Networks. In ICDE. 245--256.
    [33]
    Rong-Hua Li, Qiangqiang Dai, Guoren Wang, Zhong Ming, Lu Qin, and Jeffrey Xu Yu. 2019. Improved Algorithms for Maximal Clique Search in Uncertain Networks. In ICDE. 1178--1189.
    [34]
    Rong-Hua Li, Jeffrey Xu Yu, Rui Mao, and Tan Jin. 2016. Recursive Stratified Sampling: A New Framework for Query Evaluation on Uncertain Graphs. IEEE Trans. Knowl. Data Eng., Vol. 28, 2 (2016), 468--482.
    [35]
    Don R. Lick and Arthur T. White. 1970. k-Degenerate graphs. Canadian Journal of Mathematics, Vol. 22, 5 (1970), 1082--1096.
    [36]
    Michael Luby. 1986. A simple parallel algorithm for the maximal independent set problem. SIAM journal on computing, Vol. 15, 4 (1986), 1036--1053.
    [37]
    Kazuhisa Makino and Takeaki Uno. 2004. New Algorithms for Enumerating All Maximal Cliques. In SWAT, Vol. 3111. 260--272.
    [38]
    Julian Mcauley and Jure Leskovec. 2014. Discovering social circles in ego networks. ACM Trans. on Knowl. Discovery from Data, Vol. 8, 1 (2014), 1--28.
    [39]
    Yasir Mehmood, Francesco Bonchi, and David Garc'i a-Soriano. 2016. Spheres of Influence for More Effective Viral Marketing. In SIGMOD. 711--726.
    [40]
    Zhuqi Miao, Balabhaskar Balasundaram, and Eduardo L. Pasiliao. 2014. An exact algorithm for the maximum probabilistic clique problem. J. Comb. Optim., Vol. 28, 1 (2014), 105--120.
    [41]
    Tom M. Mitchell, William W. Cohen, Estevam R. Hruschka Jr., Partha P. Talukdar, Bo Yang, Justin Betteridge, Andrew Carlson, Bhavana Dalvi Mishra, Matt Gardner, Bryan Kisiel, Jayant Krishnamurthy, Ni Lao, Kathryn Mazaitis, Thahir Mohamed, Ndapandula Nakashole, Emmanouil A. Platanios, Alan Ritter, Mehdi Samadi, Burr Settles, Richard C. Wang, Derry Wijaya, Abhinav Gupta, Xinlei Chen, Abulhair Saparov, Malcolm Greaves, and Joel Welling. 2018. Never-ending learning. Commun. ACM, Vol. 61, 5 (2018), 103--115.
    [42]
    Arko Mukherjee, Pan Xu, and Srikanta Tirthapura. 2017. Enumeration of Maximal Cliques from an Uncertain Graph. IEEE Trans. Knowl. Data Eng., Vol. 29, 3 (2017), 543--555.
    [43]
    Arko Provo Mukherjee, Pan Xu, and Srikanta Tirthapura. 2015. Mining maximal cliques from an uncertain graph. In ICDE. 243--254.
    [44]
    Kevin A. Naudé. 2016. Refined pivot selection for maximal clique enumeration in graphs. Theor. Comput. Sci., Vol. 613 (2016), 28--37.
    [45]
    Nishith Pathak, Sandeep Mane, and Jaideep Srivastava. 2006. Who Thinks Who Knows Who? Socio-cognitive Analysis of Email Networks. In ICDM. 466--477.
    [46]
    Michalis Potamias, Francesco Bonchi, Aristides Gionis, and George Kollios. 2010. k-Nearest Neighbors in Uncertain Graphs. PVLDB, Vol. 3, 1 (2010), 997--1008.
    [47]
    Yu-Xuan Qiu, Rong-Hua Li, Jianxin Li, Shaojie Qiao, Guoren Wang, Jeffrey Xu Yu, and Rui Mao. 2019. Efficient Structural Clustering on Probabilistic Graphs. IEEE Trans. Knowl. Data Eng., Vol. 31, 10 (2019), 1954--1968.
    [48]
    Ron Rymon. 1992. Search through Systematic Set Enumeration. In Proceedings of the 3rd International Conference on Principles of Knowledge Representation and Reasoning. 539--550.
    [49]
    Arkaprava Saha, Ruben Brokkelkamp, Yllka Velaj, Arijit Khan, and Francesco Bonchi. 2021. Shortest Paths and Centrality in Uncertain Networks. Proc. VLDB Endow., Vol. 14, 7 (2021), 1188--1201.
    [50]
    Roded Sharan, Igor Ulitsky, and Ron Shamir. 2007. Network-based prediction of protein function. Molecular systems biology, Vol. 3, 1 (2007), 88.
    [51]
    Robyn Speer, Joshua Chin, and Catherine Havasi. 2017. ConceptNet 5.5: An Open Multilingual Graph of General Knowledge. In AAAI. 4444--4451.
    [52]
    Etsuji Tomita, Akira Tanaka, and Haruhisa Takahashi. 2006. The worst-case time complexity for generating all maximal cliques and computational experiments. Theor. Comput. Sci., Vol. 363, 1 (2006), 28--42.
    [53]
    Shuji Tsukiyama, Mikio Ide, Hiromu Ariyoshi, and Isao Shirakawa. 1977. A New Algorithm for Generating All the Maximal Independent Sets. SIAM J. Comput., Vol. 6, 3 (1977), 505--517.
    [54]
    Tiphaine Viard, Matthieu Latapy, and Clé mence Magnien. 2016. Computing maximal cliques in link streams. Theor. Comput. Sci., Vol. 609 (2016), 245--252.
    [55]
    Bohua Yang, Dong Wen, Lu Qin, Ying Zhang, Lijun Chang, and Rong-Hua Li. 2019. Index-Based Optimal Algorithm for Computing K-Cores in Large Uncertain Graphs. In ICDE. 64--75.
    [56]
    Long Yuan, Lu Qin, Xuemin Lin, Lijun Chang, and Wenjie Zhang. 2017. Effective and Efficient Dynamic Graph Coloring. Proc. VLDB Endow., Vol. 11, 3 (2017), 338--351.
    [57]
    Long Yuan, Lu Qin, Wenjie Zhang, Lijun Chang, and Jianye Yang. 2018. Index-Based Densest Clique Percolation Community Search in Networks. IEEE Trans. Knowl. Data Eng., Vol. 30, 5 (2018), 922--935.

    Cited By

    View all
    • (2024)Mining Quasi-Periodic Communities in Temporal Network2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00195(2476-2488)Online publication date: 13-May-2024
    • (2024)A two-phase approach for enumeration of maximal $$(\Delta , \gamma )$$-cliques of a temporal networkSocial Network Analysis and Mining10.1007/s13278-024-01207-y14:1Online publication date: 8-Mar-2024
    • (2023)Efficient Maximal Biclique Enumeration on Large Uncertain Bipartite GraphsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.327211035:12(12634-12648)Online publication date: 8-May-2023
    • Show More Cited By

    Index Terms

    1. Fast Maximal Clique Enumeration on Uncertain Graphs: A Pivot-based Approach

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGMOD '22: Proceedings of the 2022 International Conference on Management of Data
      June 2022
      2597 pages
      ISBN:9781450392495
      DOI:10.1145/3514221
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 11 June 2022

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. graph mining
      2. maximal clique
      3. pivot enumeration
      4. uncertain graph

      Qualifiers

      • Research-article

      Funding Sources

      • National Key Research and Development Program of China
      • NSFC

      Conference

      SIGMOD/PODS '22
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 785 of 4,003 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)133
      • Downloads (Last 6 weeks)6
      Reflects downloads up to 09 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Mining Quasi-Periodic Communities in Temporal Network2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00195(2476-2488)Online publication date: 13-May-2024
      • (2024)A two-phase approach for enumeration of maximal $$(\Delta , \gamma )$$-cliques of a temporal networkSocial Network Analysis and Mining10.1007/s13278-024-01207-y14:1Online publication date: 8-Mar-2024
      • (2023)Efficient Maximal Biclique Enumeration on Large Uncertain Bipartite GraphsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.327211035:12(12634-12648)Online publication date: 8-May-2023
      • (2023)Most Probable Densest Subgraphs2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00115(1447-1460)Online publication date: Apr-2023
      • (2023)(p,q)-biclique counting and enumeration for large sparse bipartite graphsThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-023-00786-032:5(1137-1161)Online publication date: 13-Mar-2023

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media