skip to main content
10.1145/3609956.3609980acmotherconferencesArticle/Chapter ViewAbstractPublication PagessstdConference Proceedingsconference-collections
research-article
Open Access

A Scalable Unified System for Seeding Regionalization Queries

Published:24 August 2023Publication History

ABSTRACT

Spatial regionalization is the process of combining a collection of spatial polygons into contiguous regions that satisfy user-defined criteria and objectives. Numerous techniques for spatial regionalization have been proposed in the literature, which employ varying methods for region growing, seeding, optimization and enforce different user-defined constraints and objectives. This paper introduces a scalable unified system for addressing seeding spatial regionalization queries efficiently. The proposed system provides a usable and scalable framework that employs a wide-range of existing spatial regionalization techniques and allows users to submit novel combinations of queries that have not been previously explored. This represents a significant step forward in the field of spatial regionalization as it provides a robust platform for addressing different regionalization queries. The system is mainly composed of three components: query parser, query planner, and query executor. Preliminary evaluations of the system demonstrate its efficacy in efficiently addressing various regionalization queries.

References

  1. Open AI. 2023. Introducing ChatGPT. https://openai.com/blog/chatgpt.Google ScholarGoogle Scholar
  2. Hussah Alrashid, Yongyi Liu, and Amr Magdy. 2022. SMP: Scalable Max-P Regionalization. In SIGSPATIAL. Association for Computing Machinery, New York, NY, USA, 1–4.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Hussah Alrashid, Yongyi Liu, and Amr Magdy. 2023. PAGE: Parallel Scalable Regionalization Framework. Under minor revision in TSAS (2023), 1–27.Google ScholarGoogle Scholar
  4. Hussah Alrashid, Amr Magdy, and Sergio Rey. 2023. Statistical Inference for Spatial Regionalization. In Under submission to SIGSPATIAL. Association for Computing Machinery, New York, NY, USA, 1–10. https://drive.google.com/file/d/1m1C7IYhK6155U0Idsqa4YCBWqPHB22Lf/view.Google ScholarGoogle Scholar
  5. Konstantin Andreev and Harald Racke. 2006. Balanced Graph Partitioning. Theoretical Computer Science 39, 6 (2006), 929–939.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Apache. 2023. Apache Sedona. https://sedona.apache.org/latest-snapshot/.Google ScholarGoogle Scholar
  7. Daniel Arribas-Bel and Charles R Schmidt. 2013. Self-Organizing Maps and the US Urban Spatial Structure. EPB 40 (2013), 362–371.Google ScholarGoogle Scholar
  8. Renato M Assunção, Marcos Corrêa Neves, Gilberto Câmara, and Corina da Costa Freitas. 2006. Efficient Regionalization Techniques for Socio-economic Geographical Units Using Minimum Spanning Trees. IJGIS 20 (2006), 797–811.Google ScholarGoogle ScholarCross RefCross Ref
  9. Orhun Aydin, Mark V Janikas, Renato Assunção, and Ting-Hwan Lee. 2018. SKATER-CON: Unsupervised Regionalization via Stochastic Tree Partitioning Within a Consensus Framework Using Random Spanning Trees. In ACMGeoAI. Association for Computing Machinery, New York, NY, USA, 33–42.Google ScholarGoogle Scholar
  10. Roberto Benedetti, Federica Piersimoni, Giacomo Pignataro, and Francesco Vidoli. 2020. The Identification of Spatially Constrained Homogeneous Clusters of Covid-19 Transmission in Italy. RSPP 12 (2020), 1169–1187.Google ScholarGoogle Scholar
  11. Una Benlic and Jin-Kao Hao. 2011. An Effective Multilevel Tabu Search Approach for Balanced Graph Partitioning. Operations Research 38, 7 (2011), 1066–1075.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Daniel Bereznyi, Ahmad Qutbuddin, YoungGu Her, and KwangSoo Yang. 2020. Node-attributed Spatial Graph Partitioning. In SIGSPATIAL. Association for Computing Machinery, New York, NY, USA, 58–67.Google ScholarGoogle Scholar
  13. Subhodip Biswas, Fanglan Chen, Zhiqian Chen, Chang-Tien Lu, and Naren Ramakrishnan. 2020. Incorporating Domain Knowledge into Memetic Algorithms for Solving Spatial Optimization Problems. In SIGSPATIAL. Association for Computing Machinery, New York, NY, USA, 25–35.Google ScholarGoogle Scholar
  14. Subhodip Biswas and et. al.2019. REGAL: A Regionalization Framework for School Boundaries. In SIGSPATIAL. Association for Computing Machinery, New York, NY, USA, 544–547.Google ScholarGoogle Scholar
  15. U.S. Census Bureau. 2019. TIGER/Line Shapefile, 2016, Series Information for the Current Census Tract State-based Shapefile. https://catalog.data.gov/dataset/tiger-line-shapefile-2016-series-information-for-the-current-census-tract-state-based-shapefile.Google ScholarGoogle Scholar
  16. Google Chrome. 2023. Talk to ChatGPT. https://github.com/C-Nedelcu/talk-to-chatgpt.Google ScholarGoogle Scholar
  17. David Combe, Christine Largeron, Elöd Egyed-Zsigmond, and Mathias Géry. 2012. Combining Relations and Text in Scientific Network Clustering. In ASONAM. IEEE Computer Society, Washington, D.C., USA, 1248–1253.Google ScholarGoogle Scholar
  18. Mike Conover and et. al.2023. Free Dolly: Introducing the World’s First Truly Open Instruction-Tuned LLM. https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm.Google ScholarGoogle Scholar
  19. Daniel Delling, Daniel Fleischman, Andrew V Goldberg, Ilya Razenshteyn, and Renato F Werneck. 2015. An Exact Combinatorial Algorithm for Minimum Graph Bisection. Mathematical Programming 153, 2 (2015), 417–458.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Juan C Duque, Luc Anselin, and Sergio J Rey. 2012. The Max-P-Regions Problem. JRS 52 (2012), 397–419.Google ScholarGoogle Scholar
  21. Juan C Duque, Richard L Church, and Richard S Middleton. 2011. The P-Regions Problem. Geographical Analysis 43 (2011), 104–126.Google ScholarGoogle ScholarCross RefCross Ref
  22. Juan C Duque, Jorge E Patino, Luis A Ruiz, and Josep E Pardo-Pascual. 2015. Measuring Intra-urban Poverty Using Land Cover and Texture Metrics Derived from Remote Sensing Data. LUP 135 (2015), 11–21.Google ScholarGoogle Scholar
  23. Juan Carlos Duque, Raúl Ramos, and Jordi Suriñach. 2007. Supervised Regionalization Methods: A Survey. IRSR 30 (2007), 195–220.Google ScholarGoogle ScholarCross RefCross Ref
  24. Ahmed El Kenawy, Juan I López-Moreno, and Sergio M Vicente-Serrano. 2013. Summer Temperature Extremes in Northeastern Spain: Spatial Regionalization and Links to Atmospheric Circulation (1960–2006). TAC 113 (2013), 387–405.Google ScholarGoogle Scholar
  25. Uriel Feige and Robert Krauthgamer. 2002. A Polylogarithmic Approximation of the Minimum Bisection. SICOM 31, 4 (2002), 1090–1118.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Ariel Felner. 2005. Finding Optimal Solutions to the Graph Partitioning Problem with Heuristic Search. AMAI 45, 3 (2005), 293–322.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Xin Feng, Sergio Rey, and Ran Wei. 2022. The max-p-compact-regions Problem. Transactions in GIS 26, 2 (2022), 717–734.Google ScholarGoogle ScholarCross RefCross Ref
  28. Thomas Feo, Olivier Goldschmidt, and Mallek Khellaf. 1992. One-Half Approximation Algorithms for the k-Partition Problem. Operations Research 40 (1992), S170–S173.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. David C Folch and Seth E Spielman. 2014. Identifying Regions Based On Flexible User-defined Constraints. IJGIS 28 (2014), 164–184.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Fred Glover. 1989. Tabu Search—Part I. ORSA Journal on Computing 1 (1989), 190–206.Google ScholarGoogle ScholarCross RefCross Ref
  31. Jeremiah Hurley. 2004. Regionalization and the Allocation of Healthcare Resources to Meet Population Health Needs. Healthcare Papers 5 (2004), 34–39.Google ScholarGoogle ScholarCross RefCross Ref
  32. Yunfan Kang and Amr Magdy. 2022. EMP: Max-P Regionalization with Enriched Constraints. In ICDE. IEEE, 1914–1926.Google ScholarGoogle Scholar
  33. David R Karger. 1993. Global Min-cuts in RNC, and Other Ramifications of a Simple Min-Cut Algorithm.. In SODA. Society for Industrial and Applied Mathematics, USA, 21–30.Google ScholarGoogle Scholar
  34. George Karypis and Vipin Kumar. 1998. Multilevel K-Way Partitioning Scheme for Irregular Graphs. J. Parallel and Distrib. Comput. 48, 1 (1998), 96–129.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Brian W Kernighan and Shen Lin. 1970. An Efficient Heuristic Procedure for Partitioning Graphs. The Bell System Technical Journal 49, 2 (1970), 291–307.Google ScholarGoogle ScholarCross RefCross Ref
  36. MS Khan and KF Li. 1995. Fast Graph Partitioning Algorithms. In PACRIM. IEEE Computer Society, Washington, D.C., USA, 337–342.Google ScholarGoogle Scholar
  37. Hyun Kim, Yongwan Chun, and Kamyoung Kim. 2015. Delimitation of Functional Regions Using a P-Regions Problem Approach. IRSR 38 (2015), 235–263.Google ScholarGoogle ScholarCross RefCross Ref
  38. Myung Kim and Ningchuan Xiao. 2017. Contiguity-based Optimization Models for Political Redistricting Problems. IJAGR 8, 4 (2017), 1–18.Google ScholarGoogle Scholar
  39. Yunfeng Kong, Yanfang Zhu, and Yujing Wang. 2019. A Center-based Modeling Approach to Solve the Districting Problem. IJGIS 33, 2 (2019), 368–384.Google ScholarGoogle Scholar
  40. Jason Laura, Wenwen Li, Sergio J Rey, and Luc Anselin. 2015. Parallelization of a Regionalization Heuristic in Distributed Computing Platforms–a Case Study of Parallel-P-Compact-Regions Problem. IJGIS 29 (2015), 536–555.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Wenwen Li, Richard L Church, and Michael F Goodchild. 2014. An Extendable Heuristic Framework to Solve the P-Compact-Regions Problem for Urban Economic Modeling. CEUS 43 (2014), 1–13.Google ScholarGoogle Scholar
  42. Wenwen Li, Richard L Church, and Michael F Goodchild. 2014. The p-Compact-Regions Problem. Geographical Analysis 46 (2014), 250–273.Google ScholarGoogle ScholarCross RefCross Ref
  43. Yongyi Liu, Ahmed R. Mahmood, Amr Magdy, and Sergio Rey. 2022. PRUC: P-Regions with User-Defined Constraint. In VLDB. 491–503.Google ScholarGoogle Scholar
  44. Joe Marks, Wheeler Ruml, Stuart Shieber, and J Thomas Ngo. 1998. A Seed-Growth Heuristic for Graph Bisection. Proceedings of Algorithms and Experiments’ 98 (1998), 76–87.Google ScholarGoogle Scholar
  45. Microsoft. 2023. DeepSpeed Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales. https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-chat.Google ScholarGoogle Scholar
  46. Jorge E Patino, Juan C Duque, Josep E Pardo-Pascual, and Luis A Ruiz. 2014. Using Remote Sensing to Assess the Relationship Between Crime and the Urban Layout. Applied Geography 55 (2014), 48–60.Google ScholarGoogle ScholarCross RefCross Ref
  47. Tarjan R. 1971. Depth-first Search and Linear Graph Algorithms. SICOM 1 (1971), 114–121.Google ScholarGoogle Scholar
  48. Sergio J Rey, Luc Anselin, David C Folch, Daniel Arribas-Bel, Myrna L Sastré Gutiérrez, and Lindsey Interlante. 2011. Measuring Spatial Dynamics in Metropolitan Areas. EDQ 25 (2011), 54–64.Google ScholarGoogle ScholarCross RefCross Ref
  49. Sergio J Rey and Myrna L Sastré-Gutiérrez. 2010. Interregional Inequality Dynamics in Mexico. SEA 5 (2010), 277–298.Google ScholarGoogle Scholar
  50. Kirk Schloegel, George Karypis, and Vipin Kumar. 2000. Parallel Multilevel Algorithms for Multi-Constraint Graph Partitioning. In European Conference on Parallel Processing. Springer, Berlin, Heidelberg, 296–310.Google ScholarGoogle Scholar
  51. Bing She, Juan C Duque, and Xinyue Ye. 2017. The Network-Max-P-Regions Model. IJGIS 31 (2017), 962–981.Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. V. Sindhu. 2018. Exploring Parallel Efficiency and Synergy for Max-P Region Problem Using Python. Master’s thesis. Georgia State University.Google ScholarGoogle Scholar
  53. Seth E Spielman and David C Folch. 2015. Reducing Uncertainty in the American Community Survey through Data-driven Regionalization. PloS ONE 10 (2015), e0115626.Google ScholarGoogle ScholarCross RefCross Ref
  54. Daoqin Tong and David A Plane. 2014. A New Spatial Optimization Perspective on the Delineation of Metropolitan and Micropolitan Statistical Areas. Geographical Analysis 46, 3 (2014), 230–249.Google ScholarGoogle ScholarCross RefCross Ref
  55. Ran Wei, Sergio Rey, and Elijah Knaap. 2020. Efficient Regionalization for Spatially Explicit Neighborhood Delineation. IJGIS 35 (2020), 1–17.Google ScholarGoogle Scholar
  56. Xinyue Ye, Bing She, and Samuel Benya. 2018. Exploring Regionalization in the Network Urban Space. JGSA 2 (2018), 4.Google ScholarGoogle Scholar
  57. Yang Zhou, Hong Cheng, and Jeffrey Xu Yu. 2009. Graph Clustering Based on Structural/Attribute Similarities. PVLDB 2 (2009), 718–729.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A Scalable Unified System for Seeding Regionalization Queries
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            SSTD '23: Proceedings of the 18th International Symposium on Spatial and Temporal Data
            August 2023
            204 pages
            ISBN:9798400708992
            DOI:10.1145/3609956

            Copyright © 2023 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 24 August 2023

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited
          • Article Metrics

            • Downloads (Last 12 months)90
            • Downloads (Last 6 weeks)11

            Other Metrics

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format