ABSTRACT
Spatial regionalization is the process of combining a collection of spatial polygons into contiguous regions that satisfy user-defined criteria and objectives. Numerous techniques for spatial regionalization have been proposed in the literature, which employ varying methods for region growing, seeding, optimization and enforce different user-defined constraints and objectives. This paper introduces a scalable unified system for addressing seeding spatial regionalization queries efficiently. The proposed system provides a usable and scalable framework that employs a wide-range of existing spatial regionalization techniques and allows users to submit novel combinations of queries that have not been previously explored. This represents a significant step forward in the field of spatial regionalization as it provides a robust platform for addressing different regionalization queries. The system is mainly composed of three components: query parser, query planner, and query executor. Preliminary evaluations of the system demonstrate its efficacy in efficiently addressing various regionalization queries.
- Open AI. 2023. Introducing ChatGPT. https://openai.com/blog/chatgpt.Google Scholar
- Hussah Alrashid, Yongyi Liu, and Amr Magdy. 2022. SMP: Scalable Max-P Regionalization. In SIGSPATIAL. Association for Computing Machinery, New York, NY, USA, 1–4.Google ScholarDigital Library
- Hussah Alrashid, Yongyi Liu, and Amr Magdy. 2023. PAGE: Parallel Scalable Regionalization Framework. Under minor revision in TSAS (2023), 1–27.Google Scholar
- Hussah Alrashid, Amr Magdy, and Sergio Rey. 2023. Statistical Inference for Spatial Regionalization. In Under submission to SIGSPATIAL. Association for Computing Machinery, New York, NY, USA, 1–10. https://drive.google.com/file/d/1m1C7IYhK6155U0Idsqa4YCBWqPHB22Lf/view.Google Scholar
- Konstantin Andreev and Harald Racke. 2006. Balanced Graph Partitioning. Theoretical Computer Science 39, 6 (2006), 929–939.Google ScholarDigital Library
- Apache. 2023. Apache Sedona. https://sedona.apache.org/latest-snapshot/.Google Scholar
- Daniel Arribas-Bel and Charles R Schmidt. 2013. Self-Organizing Maps and the US Urban Spatial Structure. EPB 40 (2013), 362–371.Google Scholar
- Renato M Assunção, Marcos Corrêa Neves, Gilberto Câmara, and Corina da Costa Freitas. 2006. Efficient Regionalization Techniques for Socio-economic Geographical Units Using Minimum Spanning Trees. IJGIS 20 (2006), 797–811.Google ScholarCross Ref
- Orhun Aydin, Mark V Janikas, Renato Assunção, and Ting-Hwan Lee. 2018. SKATER-CON: Unsupervised Regionalization via Stochastic Tree Partitioning Within a Consensus Framework Using Random Spanning Trees. In ACMGeoAI. Association for Computing Machinery, New York, NY, USA, 33–42.Google Scholar
- Roberto Benedetti, Federica Piersimoni, Giacomo Pignataro, and Francesco Vidoli. 2020. The Identification of Spatially Constrained Homogeneous Clusters of Covid-19 Transmission in Italy. RSPP 12 (2020), 1169–1187.Google Scholar
- Una Benlic and Jin-Kao Hao. 2011. An Effective Multilevel Tabu Search Approach for Balanced Graph Partitioning. Operations Research 38, 7 (2011), 1066–1075.Google ScholarDigital Library
- Daniel Bereznyi, Ahmad Qutbuddin, YoungGu Her, and KwangSoo Yang. 2020. Node-attributed Spatial Graph Partitioning. In SIGSPATIAL. Association for Computing Machinery, New York, NY, USA, 58–67.Google Scholar
- Subhodip Biswas, Fanglan Chen, Zhiqian Chen, Chang-Tien Lu, and Naren Ramakrishnan. 2020. Incorporating Domain Knowledge into Memetic Algorithms for Solving Spatial Optimization Problems. In SIGSPATIAL. Association for Computing Machinery, New York, NY, USA, 25–35.Google Scholar
- Subhodip Biswas and et. al.2019. REGAL: A Regionalization Framework for School Boundaries. In SIGSPATIAL. Association for Computing Machinery, New York, NY, USA, 544–547.Google Scholar
- U.S. Census Bureau. 2019. TIGER/Line Shapefile, 2016, Series Information for the Current Census Tract State-based Shapefile. https://catalog.data.gov/dataset/tiger-line-shapefile-2016-series-information-for-the-current-census-tract-state-based-shapefile.Google Scholar
- Google Chrome. 2023. Talk to ChatGPT. https://github.com/C-Nedelcu/talk-to-chatgpt.Google Scholar
- David Combe, Christine Largeron, Elöd Egyed-Zsigmond, and Mathias Géry. 2012. Combining Relations and Text in Scientific Network Clustering. In ASONAM. IEEE Computer Society, Washington, D.C., USA, 1248–1253.Google Scholar
- Mike Conover and et. al.2023. Free Dolly: Introducing the World’s First Truly Open Instruction-Tuned LLM. https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm.Google Scholar
- Daniel Delling, Daniel Fleischman, Andrew V Goldberg, Ilya Razenshteyn, and Renato F Werneck. 2015. An Exact Combinatorial Algorithm for Minimum Graph Bisection. Mathematical Programming 153, 2 (2015), 417–458.Google ScholarDigital Library
- Juan C Duque, Luc Anselin, and Sergio J Rey. 2012. The Max-P-Regions Problem. JRS 52 (2012), 397–419.Google Scholar
- Juan C Duque, Richard L Church, and Richard S Middleton. 2011. The P-Regions Problem. Geographical Analysis 43 (2011), 104–126.Google ScholarCross Ref
- Juan C Duque, Jorge E Patino, Luis A Ruiz, and Josep E Pardo-Pascual. 2015. Measuring Intra-urban Poverty Using Land Cover and Texture Metrics Derived from Remote Sensing Data. LUP 135 (2015), 11–21.Google Scholar
- Juan Carlos Duque, Raúl Ramos, and Jordi Suriñach. 2007. Supervised Regionalization Methods: A Survey. IRSR 30 (2007), 195–220.Google ScholarCross Ref
- Ahmed El Kenawy, Juan I López-Moreno, and Sergio M Vicente-Serrano. 2013. Summer Temperature Extremes in Northeastern Spain: Spatial Regionalization and Links to Atmospheric Circulation (1960–2006). TAC 113 (2013), 387–405.Google Scholar
- Uriel Feige and Robert Krauthgamer. 2002. A Polylogarithmic Approximation of the Minimum Bisection. SICOM 31, 4 (2002), 1090–1118.Google ScholarDigital Library
- Ariel Felner. 2005. Finding Optimal Solutions to the Graph Partitioning Problem with Heuristic Search. AMAI 45, 3 (2005), 293–322.Google ScholarDigital Library
- Xin Feng, Sergio Rey, and Ran Wei. 2022. The max-p-compact-regions Problem. Transactions in GIS 26, 2 (2022), 717–734.Google ScholarCross Ref
- Thomas Feo, Olivier Goldschmidt, and Mallek Khellaf. 1992. One-Half Approximation Algorithms for the k-Partition Problem. Operations Research 40 (1992), S170–S173.Google ScholarDigital Library
- David C Folch and Seth E Spielman. 2014. Identifying Regions Based On Flexible User-defined Constraints. IJGIS 28 (2014), 164–184.Google ScholarDigital Library
- Fred Glover. 1989. Tabu Search—Part I. ORSA Journal on Computing 1 (1989), 190–206.Google ScholarCross Ref
- Jeremiah Hurley. 2004. Regionalization and the Allocation of Healthcare Resources to Meet Population Health Needs. Healthcare Papers 5 (2004), 34–39.Google ScholarCross Ref
- Yunfan Kang and Amr Magdy. 2022. EMP: Max-P Regionalization with Enriched Constraints. In ICDE. IEEE, 1914–1926.Google Scholar
- David R Karger. 1993. Global Min-cuts in RNC, and Other Ramifications of a Simple Min-Cut Algorithm.. In SODA. Society for Industrial and Applied Mathematics, USA, 21–30.Google Scholar
- George Karypis and Vipin Kumar. 1998. Multilevel K-Way Partitioning Scheme for Irregular Graphs. J. Parallel and Distrib. Comput. 48, 1 (1998), 96–129.Google ScholarDigital Library
- Brian W Kernighan and Shen Lin. 1970. An Efficient Heuristic Procedure for Partitioning Graphs. The Bell System Technical Journal 49, 2 (1970), 291–307.Google ScholarCross Ref
- MS Khan and KF Li. 1995. Fast Graph Partitioning Algorithms. In PACRIM. IEEE Computer Society, Washington, D.C., USA, 337–342.Google Scholar
- Hyun Kim, Yongwan Chun, and Kamyoung Kim. 2015. Delimitation of Functional Regions Using a P-Regions Problem Approach. IRSR 38 (2015), 235–263.Google ScholarCross Ref
- Myung Kim and Ningchuan Xiao. 2017. Contiguity-based Optimization Models for Political Redistricting Problems. IJAGR 8, 4 (2017), 1–18.Google Scholar
- Yunfeng Kong, Yanfang Zhu, and Yujing Wang. 2019. A Center-based Modeling Approach to Solve the Districting Problem. IJGIS 33, 2 (2019), 368–384.Google Scholar
- Jason Laura, Wenwen Li, Sergio J Rey, and Luc Anselin. 2015. Parallelization of a Regionalization Heuristic in Distributed Computing Platforms–a Case Study of Parallel-P-Compact-Regions Problem. IJGIS 29 (2015), 536–555.Google ScholarDigital Library
- Wenwen Li, Richard L Church, and Michael F Goodchild. 2014. An Extendable Heuristic Framework to Solve the P-Compact-Regions Problem for Urban Economic Modeling. CEUS 43 (2014), 1–13.Google Scholar
- Wenwen Li, Richard L Church, and Michael F Goodchild. 2014. The p-Compact-Regions Problem. Geographical Analysis 46 (2014), 250–273.Google ScholarCross Ref
- Yongyi Liu, Ahmed R. Mahmood, Amr Magdy, and Sergio Rey. 2022. PRUC: P-Regions with User-Defined Constraint. In VLDB. 491–503.Google Scholar
- Joe Marks, Wheeler Ruml, Stuart Shieber, and J Thomas Ngo. 1998. A Seed-Growth Heuristic for Graph Bisection. Proceedings of Algorithms and Experiments’ 98 (1998), 76–87.Google Scholar
- Microsoft. 2023. DeepSpeed Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales. https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-chat.Google Scholar
- Jorge E Patino, Juan C Duque, Josep E Pardo-Pascual, and Luis A Ruiz. 2014. Using Remote Sensing to Assess the Relationship Between Crime and the Urban Layout. Applied Geography 55 (2014), 48–60.Google ScholarCross Ref
- Tarjan R. 1971. Depth-first Search and Linear Graph Algorithms. SICOM 1 (1971), 114–121.Google Scholar
- Sergio J Rey, Luc Anselin, David C Folch, Daniel Arribas-Bel, Myrna L Sastré Gutiérrez, and Lindsey Interlante. 2011. Measuring Spatial Dynamics in Metropolitan Areas. EDQ 25 (2011), 54–64.Google ScholarCross Ref
- Sergio J Rey and Myrna L Sastré-Gutiérrez. 2010. Interregional Inequality Dynamics in Mexico. SEA 5 (2010), 277–298.Google Scholar
- Kirk Schloegel, George Karypis, and Vipin Kumar. 2000. Parallel Multilevel Algorithms for Multi-Constraint Graph Partitioning. In European Conference on Parallel Processing. Springer, Berlin, Heidelberg, 296–310.Google Scholar
- Bing She, Juan C Duque, and Xinyue Ye. 2017. The Network-Max-P-Regions Model. IJGIS 31 (2017), 962–981.Google ScholarDigital Library
- V. Sindhu. 2018. Exploring Parallel Efficiency and Synergy for Max-P Region Problem Using Python. Master’s thesis. Georgia State University.Google Scholar
- Seth E Spielman and David C Folch. 2015. Reducing Uncertainty in the American Community Survey through Data-driven Regionalization. PloS ONE 10 (2015), e0115626.Google ScholarCross Ref
- Daoqin Tong and David A Plane. 2014. A New Spatial Optimization Perspective on the Delineation of Metropolitan and Micropolitan Statistical Areas. Geographical Analysis 46, 3 (2014), 230–249.Google ScholarCross Ref
- Ran Wei, Sergio Rey, and Elijah Knaap. 2020. Efficient Regionalization for Spatially Explicit Neighborhood Delineation. IJGIS 35 (2020), 1–17.Google Scholar
- Xinyue Ye, Bing She, and Samuel Benya. 2018. Exploring Regionalization in the Network Urban Space. JGSA 2 (2018), 4.Google Scholar
- Yang Zhou, Hong Cheng, and Jeffrey Xu Yu. 2009. Graph Clustering Based on Structural/Attribute Similarities. PVLDB 2 (2009), 718–729.Google ScholarDigital Library
Index Terms
- A Scalable Unified System for Seeding Regionalization Queries
Recommendations
PAGE: Parallel Scalable Regionalization Framework
Regionalization techniques group spatial areas into a set of homogeneous regions to analyze and draw conclusions about spatial phenomena. A recent regionalization problem, called MP-regions, groups spatial areas to produce a maximum number of regions by ...
Scalable and efficient processing of top-k multiple-type integrated queries
AbstractIn this paper, we define a new class of queries, the top-k multiple-type integrated query (simply, top-k MULTI query). It deals with multiple data types and finds the information in the order of relevance between the query and the object. Various ...
Spatial clustering and outlier analysis for the regionalization of maize cultivation in China
Regionalization has been the foundation of large-scale plantation and local optimization for crop cultivation. Current regionalization approaches practiced mainly rely on qualitative analysis and heuristic methods, which cannot meet the increasingly ...
Comments