Abstract
With the era of Big data, People, companies and devices are all becoming factories of data which generate an incredible amount of information. In several real-life applications, exploiting data with spatial characteristics is of great interest. For this purpose, different approaches have been proposed to deal efficiently with this kind of data (e.g., Spatial-Hadoop, Location-Spark). Spatial index based on R-tree is one of the modern solutions proposed to accelerate access to the desired information. Exploring such index could seriously impact performance. In this paper, we propose a novel approach to explore the R-tree spatial index, making it possible to minimise the number of disk accesses and consequently the execution time. Our experiments show that our approach outperforms the standard approach used by existing systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Aji, A., et al.: Hadoop GIS: a high performance spatial data warehousing system over MapReduce. Proc. VLDB Endow. 6(11), 1009–1020 (2013)
Anselin, L., Syabri, I., Kho, Y.: GeoDa: an introduction to spatial data analysis. In: Fischer, M., Getis, A. (eds.) Handbook of Applied Spatial Analysis. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-03647-7_5
Beckmann, N., Kriegel, H.P., Schneider, R., Seeger, B.: The R*-tree: an efficient and robust access method for points and rectangles. ACM SIGMOD Record 19, 322–331 (1990)
Brinkhoff, T., Kriegel, H.P., Seeger, B.: Efficient processing of spatial joins using R-trees, vol. 22. ACM (1993)
Connor, M., Kumar, P.: Fast construction of k-nearest neighbor graphs for point clouds. IEEE Trans. Visual Comput. Graphics 16(4), 599–608 (2010)
Eldawy, A., Mokbel, M.F.: SpatialHadoop: a mapreduce framework for spatial data. In: 2015 IEEE 31st ICDE Conference, pp. 1352–1363. IEEE (2015)
Jain, V., Lennon, J., Gupta, H.: LSM-trees and B-trees: the best of both worlds. In: Proceedings of the 2019 SIGMOD Conference, pp. 1829–1831 (2019)
Kim, K., Cha, S.K., Kwon, K.: Optimizing multidimensional index trees for main memory access. ACM SIGMOD Record 30, 139–150 (2001)
Lemire, D., Kaser, O.: Reordering columns for smaller indexes. Inf. Sci. 181(12), 2550–2570 (2011)
Leutenegger, S.T., Lopez, M.A., Edgington, J.: STR: a simple and efficient algorithm for R-tree packing. In: 13th ICDE Conference, pp. 497–506. IEEE (1997)
Leutenegger, S.T., Nicol, D.M.: Efficient bulk-loading of gridfiles. IEEE Trans. Knowl. Data Eng. 9(3), 410–420 (1997)
Lu, W., Shen, Y., Chen, S., Ooi, B.C.: Efficient processing of K nearest neighbor joins using MapReduce. Proc. VLDB Endow. 5(10), 1016–1027 (2012)
Misauer, L.: IoT, big data and AI - the new ‘superpowers’ in the digital universe (2017). https://curious.stratford.edu/2017/10/10/iot-big-data-and-ai-the-new-superpowers-in-the-digital-universe/
Roumelis, G., Vassilakopoulos, M., Corral, A., Manolopoulos, Y.: An efficient algorithm for bulk-loading xBR+-trees. Comput. Stand. Interfaces 57, 83–100 (2018)
Santos, L., Coutinho-Rodrigues, J., Antunes, C.H.: A web spatial decision support system for vehicle routing using google maps. Decis. Support Syst. 51(1), 1–9 (2011)
Silberschatz, A., Korth, H.F., Sudarshan, S., et al.: Database System Concepts, vol. 4. McGraw-Hill, New York (1997)
Tang, M., Yu, Y., Malluhi, Q.M., Ouzzani, M., Aref, W.G.: Locationspark: a distributed in-memory data management system for big spatial data. Proc. VLDB Endow. 9(13), 1565–1568 (2016)
Verlet, L.: Computer “experiments’’ on classical fluids. i. thermodynamical properties of Lennard-Jones molecules. Phys. Rev. 159(1), 98 (1967)
Vohra, D.: Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools. Apress, Berkely (2016)
Wang, F., et al.: A data model and database for high-resolution pathology analytical image informatics. J. Pathol. Inf. 2, 36 (2011)
Xie, D., Li, F., Yao, B., Li, G., Zhou, L., Guo, M.: Simba: efficient in-memory spatial analytics. In: SIGMOD 2016, pp. 1071–1085. ACM (2016)
Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. HotCloud 10(10–10), 95 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Yousfi, H., Mesmoudi, A., Hadjali, A., Matallah, H., Lahfa, F. (2022). Efficient R-Tree Exploration for Big Spatial Data. In: Kacprzyk, J., Balas, V.E., Ezziyyani, M. (eds) Advanced Intelligent Systems for Sustainable Development (AI2SD’2020). AI2SD 2020. Advances in Intelligent Systems and Computing, vol 1418. Springer, Cham. https://doi.org/10.1007/978-3-030-90639-9_70
Download citation
DOI: https://doi.org/10.1007/978-3-030-90639-9_70
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-90638-2
Online ISBN: 978-3-030-90639-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)