skip to main content
10.1145/3350755.3400282acmconferencesArticle/Chapter ViewAbstractPublication PagesspaaConference Proceedingsconference-collections
research-article

Graph Sparsification for Derandomizing Massively Parallel Computation with Low Space

Published:09 July 2020Publication History

ABSTRACT

Massively Parallel Computation (MPC) is an emerging model which distills core aspects of distributed and parallel computation. It was developed as a tool to solve (typically graph) problems in systems where input is distributed over many machines with limited space. Recent work has focused on the regime in which machines have sublinear (in n, number of nodes in the input graph) space, with randomized algorithms presented for the fundamental problems of Maximal Matching and Maximal Independent Set. There are, however, no prior corresponding deterministic algorithms.

A major challenge in the sublinear space setting is that the local space of each machine may be too small to store all the edges incident to a single node. To overcome this barrier we introduce a new graph sparsification technique that deterministically computes a low-degree subgraph with additional desired properties: degrees in the subgraph are sufficiently small that nodes' neighborhoods can be stored on single machines, and solving the problem on the subgraph provides significant global progress towards solving the problem for the original input graph.

Using this framework to derandomize the well-known randomized algorithm of Luby [SICOMP'86], we obtain O(log Δ+log log n)$-round deterministic MPC algorithms for solving the fundamental problems of Maximal Matching and Maximal Independent Set with O(nε) space on each machine for any constant ε > 0. Based on the recent work of Ghaffari et al. [FOCS'18], this additive O(log log n) factor is conditionally essential. These algorithms can also be shown to run in O(log Δ) rounds in the closely related model of CONGESTED CLIQUE, improving upon the state-of-the-art bound of O(log2 Δ) rounds by Censor-Hillel et al. [DISC'17].

References

  1. Noga Alon, László Babai, and Alon Itai. A fast and simple randomized parallel algorithm for the maximal independent set problem. Journal of Algorithms, 7(4):567--583, 1986.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Alexandr Andoni, Zhao Song, Clifford Stein, Zhengyu Wang, and Peilin Zhong. Parallel graph connectivity in log diameter rounds. In ¶roc 59th FOCS, pages 674--685, 2018.Google ScholarGoogle Scholar
  3. Alexandr Andoni, Clifford Stein, and Peilin Zhong. Parallel approximate undirected shortest paths via low hop emulators. In ¶roc 52nd STOC, 2020.Google ScholarGoogle Scholar
  4. Sepehr Assadi, MohammadHossein Bateni, Aaron Bernstein, Vahab Mirrokni, and Cliff Stein. Coresets meet EDCS: Algorithms for matching and vertex cover on massive graphs. In ¶roc 30th SODA, pages 1616--1635, 2019.Google ScholarGoogle Scholar
  5. Sepehr Assadi, Xiaorui Sun, and Omri Weinstein. Massively parallel algorithms for finding well-connected components in sparse graphs. In ¶roc 37th ¶ODC, pages 461--470, 2019.Google ScholarGoogle Scholar
  6. Soheil Behnezhad, Sebastian Brandt, Mahsa Derakhshan, Manuela Fischer, MohammadTaghi Hajiaghayi, Richard M. Karp, and Jara Uitto. Massively parallel computation of matching and MIS in sparse graphs. In ¶roc 37th ¶ODC, pages 481--490, 2019. A preliminary version of a merge of CoRR abs/1807.06701 and CoRR abs/1807.05374.Google ScholarGoogle Scholar
  7. Soheil Behnezhad, Mahsa Derakhshan, and MohammadTaghi Hajiaghayi. Semi-MapReduce meets Congested Clique. CoRR abs/1802.10297, 2018.Google ScholarGoogle Scholar
  8. Soheil Behnezhad, Laxman Dhulipala, Hossein Esfandiari, Jakub Łcacki, and Vahab S. Mirrokni. Near-optimal massively parallel graph connectivity. In ¶roc 60th FOCS, pages 1615--1636, 2019.Google ScholarGoogle Scholar
  9. Soheil Behnezhad, Laxman Dhulipala, Hossein Esfandiari, Jakub Łcacki, Vahab S. Mirrokni, and Warren Schudy. Massively parallel computation via remote memory access. In ¶roc 31st SPAA, pages 59--68, 2019.Google ScholarGoogle Scholar
  10. Soheil Behnezhad, MohammadTaghi Hajiaghayi, and David G. Harris. Exponentially faster massively parallel maximal matching. In ¶roc 60th FOCS, pages 1637--1649, 2019.Google ScholarGoogle Scholar
  11. Mihir Bellare and John Rompel. Randomness-efficient oblivious sampling. In ¶roc 35th FOCS, pages 276--287, 1994.Google ScholarGoogle Scholar
  12. Keren Censor-Hillel, Merav Parter, and Gregory Schwartzman. Derandomizing local distributed algorithms under bandwidth restrictions. In ¶roc 31st DISC, pages 11:1--11:16, 2017.Google ScholarGoogle Scholar
  13. Yi-Jun Chang, Manuela Fischer, Mohsen Ghaffari, Jara Uitto, and Yufan Zheng. The complexity of (Δ+1) coloring in congested clique, massively parallel computation, and centralized local computation. In ¶roc 38th ¶ODC, pages 471--480, 2019.Google ScholarGoogle Scholar
  14. Artur Czumaj, Peter Davies, and Merav Parter. Simple, deterministic, constant-round coloring in the congested clique. In ¶roc 39th ¶ODC, 2020.Google ScholarGoogle Scholar
  15. Artur Czumaj, Jakub Łcacki, Aleksander Mcadry, Slobodan Mitrović, Krzysztof Onak, and Piotr Sankowski. Round compression for parallel matching algorithms. In ¶roc 50th STOC, pages 471--484, 2018.Google ScholarGoogle Scholar
  16. Jeffrey Dean and Sanjay Ghemawat. MapReduce: Simplified data processing on large clusters. In ¶roc 6th ØSDI, pages 10--10, 2004.Google ScholarGoogle Scholar
  17. Jeffrey Dean and Sanjay Ghemawat. MapReduce: Simplified data processing on large clusters. Commununication of the ACM, 51(1):107--113, January 2008.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Mohsen Ghaffari. An improved distributed algorithm for maximal independent set. In ¶roc 27th SODA, pages 270--277, 2016.Google ScholarGoogle Scholar
  19. Mohsen Ghaffari, Themis Gouleakis, Christian Konrad, Slobodan Mitrović, and Ronitt Rubinfeld. Improved massively parallel computation algorithms for MIS, matching, and vertex cover. In ¶roc 36th ¶ODC, pages 129--138, 2018.Google ScholarGoogle Scholar
  20. Mohsen Ghaffari, Christoph Grunau, and Ce Jin. Improved MPC algorithms for MIS, matching, and coloring on trees and beyond. CoRR abs/2002.09610, February 2020.Google ScholarGoogle Scholar
  21. Mohsen Ghaffari, Fabian Kuhn, and Jara Uitto. Conditional hardness results for massively parallel computation from distributed lower bounds. In ¶roc 60th FOCS, pages 1650--1663, 2019.Google ScholarGoogle Scholar
  22. Mohsen Ghaffari and Jara Uitto. Sparsifying distributed algorithms with ramifications in massively parallel computation and centralized local computation. In ¶roc 30th SODA, pages 1636--1653, 2019.Google ScholarGoogle Scholar
  23. Mark K. Goldberg and Thomas H. Spencer. A new parallel algorithm for the maximal independent set problem. SICOMP, 18(2):419--427, 1989.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. M. Goodrich. Communication-efficient parallel sorting. SIAM Journal on Computing, 29(2):416--432, 1999.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Michael T. Goodrich, Nodari Sitchinava, and Qin Zhang. Sorting, searching, and simulation in the MapReduce framework. In ¶roc 22nd ISAAC, pages 374--383, 2011.Google ScholarGoogle Scholar
  26. Yijie Han. A fast derandomization scheme and its applications. SICOMP, 25(1):52--82, 1996.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. David G. Harris. Deterministic parallel algorithms for bilinear objective functions. Algorithmica, 81(3):1288--1318, 2019.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrell, and Dennis Fetterly. Dryad: Distributed data-parallel programs from sequential building blocks. SIGOPS Operating Systems Review, 41(3):59--72, March 2007.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Amos Israeli and Alon Itai. A fast and simple randomized parallel algorithm for maximal matching. Information Processing Letters, 22(2):77--80, 1986.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Howard J. Karloff, Siddharth Suri, and Sergei Vassilvitskii. A model of computation for MapReduce. In ¶roc 21st SODA, pages 938--948, 2010.Google ScholarGoogle Scholar
  31. Richard M. Karp and Avi Wigderson. A fast parallel algorithm for the maximal independent set problem. JACM, 32(4):762--773, 1985.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Fabian Kuhn. Weak graph colorings: Distributed algorithms and applications. In ¶roc 21st SPAA, pages 138--144, 2009.Google ScholarGoogle Scholar
  33. Silvio Lattanzi, Benjamin Moseley, Siddharth Suri, and Sergei Vassilvitskii. Filtering: A method for solving graph problems in MapReduce. In ¶roc 23rd SPAA, pages 85--94, 2011.Google ScholarGoogle Scholar
  34. Jakub Łcacki, Slobodan Mitrović, Krzysztof Onak, and Piotr Sankowski. Walking randomly, massively, and efficiently. In ¶roc 52nd STOC, 2020.Google ScholarGoogle Scholar
  35. Christoph Lenzen. Optimal deterministic routing and sorting on the congested clique. In ¶roc 32nd ¶ODC, pages 42--50, 2013.Google ScholarGoogle Scholar
  36. Nathan Linial. Locality in distributed graph algorithms. SICOMP, 21(1):193--201, February 1992.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Zvi Lotker, Elan Pavlov, Boaz Patt-Shamir, and David Peleg. Mst construction in o (log log n) communication rounds. In Proceedings of the fifteenth annual ACM symposium on Parallel algorithms and architectures, pages 94--100, 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Michael Luby. A simple parallel algorithm for the maximal independent set problem. SIAM Journal on Computing, 15(4):1036--1053, 1986.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Tim Roughgarden, Sergei Vassilvitski, and Joshua R. Wang. Shuffles and circuits (on lower bounds for modern parallel computation). JACM, 65(6):41:1--41:24, November 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Salil P. Vadhan. Pseudorandomness. Foundations and Trends in Theoretical Computer Science, 7(1--3):1--336, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Eric Vigoda. Lecture notes for randomized algorithms: Luby's alg. for maximal independent sets using pairwise independence. https://www.cc.gatech.edu/ vigoda/RandAlgs/MIS.pdf, 2006.Google ScholarGoogle Scholar
  42. Tom White. Hadoop: The Definitive Guide. O'Reilly Media, Inc., 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Matei Zaharia, Mosharaf Chowdhury, Michael J. Franklin, Scott Shenker, and Ion Stoica. Spark: Cluster computing with working sets. In ¶roc 2nd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud), 2010.Google ScholarGoogle Scholar

Index Terms

  1. Graph Sparsification for Derandomizing Massively Parallel Computation with Low Space

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and Architectures
          July 2020
          601 pages
          ISBN:9781450369350
          DOI:10.1145/3350755

          Copyright © 2020 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 9 July 2020

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate447of1,461submissions,31%

          Upcoming Conference

          SPAA '24

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader