Abstract
Let a tuple of n objects obeying a query graph (QG) be called the n-tuple. The “D_distance-value” of this n-tuple is the value of a linear function of distances of the n objects that make up this n-tuple, according to the edges of the QG. This paper addresses the problem of finding the K n-tuples between n spatial datasets that have the smallest D_distance-values, the so-called K-multi-way distance join query (K-MWDJQ), where each set is indexed by an R-tree-based structure. This query can be viewed as an extension of K-closest-pairs query (K-CPQ) [8] for n inputs. In addition, a recursive non-incremental branch-and-bound algorithm following a depth-first search for processing synchronously all inputs without producing any intermediate result is proposed. Enhanced pruning techniques are also applied to n R-trees nodes in order to reduce the total response time and the number of distance computations of the query. Due to the exponential nature of the problem, we also propose a time-based approximate version of the recursive algorithm that combines approximation techniques to adjust the quality of the result and the global processing time. Finally, we give a detailed experimental study of the proposed algorithms using real spatial datasets, highlighting their performance and the quality of the approximate results.
Similar content being viewed by others
References
S. Berchtold, B. Ertl, D. Keim, H.P. Kriegel, and T Seidl. “Fast nearest neighbor search in high-dimensional spaces,” Proceedings of 14th International Conference on Data Engineering (ICDE'98), 209–218, 1998.
N. Beckmann, H.P. Kriegel, R. Schneider, and B. Seeger. “The R*-tree: An efficient and robust access method for points and rectangles,” Proceedings of ACM SIGMOD Conference, 322–331, 1990.
T Brinkhoff, H.P. Kriegel, and B. Seeger. “Efficient processing of spatial joins using R-trees,” Proceedings of ACM SIGMOD Conference, 237–246, 1993.
C. Böhm and F. Krebs. “High performance data mining using the nearest neighbor join,” Proceedings of 2nd International Conference on Data Mining (ICDM'02), 43–50, 2002.
P. Brown. Object-Relational Database Development: A Plumber's Guide. Prentice Hall, Upper Saddle River, NJ, 2001.
A. Corral, J. Canadas, and M. Vassilakopoulos. “Approximate algorithms for distance-based queries in high-dimensional data spaces using R-trees,” Proceedings of 6th Conference on Advances in Databases and Information Systems (ADBIS'02), 163–176, 2002.
A. Corral, Y. Manolopoulos, Y. Theodoridis, and M. Vassilakopoulos. “Closest pair queries in spatial databases,” Proceedings of ACM SIGMOD Conference, 189–200, 2000.
A. Corral, Y. Manolopoulos, Y. Theodoridis, and M. Vassilakopoulos. “Algorithms for processing K-closest-pair queries in spatial databases,” Data & Knowledge Engineering, Vol. 49(1): 67–104, 2004.
Digital Chart of the World: Real spatial datasets of the world at 1:1,000,000 scale. 1997. Downloadable from: http://www.maproom.psu.edu/dew.
V. Gaede and O. Gunther. “Multidimensional access methods,” ACM Computing Surveys, Vol. 30(2):170–231, 1998.
A. Guttman. “R-trees: A dynamic index structure for spatial searching,” Proceedings of ACM SIGMOD Conference, 47–57, 1984.
Y.W. Huang, N. Jing, and E.A. Rundensteiner. “Spatial joins using R-trees: Breadth-first traversal with global optimizations,” Proceedings of 23rd VLDB Conference, 396–405, 1997.
G.R. Hjaltason and H. Samet. “Incremental distance join algorithms for spatial databases,” Proceedings of ACM SIGMOD Conference, 237–248, 1998.
G.R. Hjaltason and H. Samet. “Distance browsing in spatial databases,” ACM Transactions on Database Systems, Vol. 24(2):265–318, 1999.
N. Koudas and K.C. Sevcik. “Size separation spatial join,” Proceedings of ACM SIGMOD Conference, 324–335,1997.
N. Koudas and K.C. Sevcik. “High dimensional similarity joins: Algorithms and performance evaluation,” Proceedings of 14th International Conference on Data Engineering (ICDE'98), 466–475, 1998.
R. Laurini and C Thomson. Fundamentals of Spatial Information System. Academic Press, London, 1992.
M.L. Lo and C.V. Ravishankar. “Spatial joins using seeded trees,” Proceedings of ACM SIGMOD Conference, 209–220, 1994.
M.L. Lo and C.V. Ravishankar. “Spatial hash-joins,” Proceedings of ACM SIGMOD Conference, 247–258, 1996.
N. Mamoulis and D. Papadias. “Integration of spatial join algorithms for processing multiple inputs,” Proceedings of ACM SIGMOD Conference, 1–12, 1999.
N. Mamoulis and D. Papadias. “Multiway spatial joins,” ACM Transactions on Database Systems, Vol. 26(4):424–475, 2001.
N. Mamoulis and D. Papadias. “Slot index spatial join,” IEEE Transactions on Knowledge and Data Engineering, Vol. 15(1):211–231, 2003.
Y. Manolopoulos, Y. Theodoridis, and V. Tsotras. Advanced Database Indexing. Kluwer Academic Publishers, Boston, 1999.
Oracle Technology Network: “Oracle Spatial”, an Oracle Technical White Paper, 2001. Downloadable from: http://otn.oracle.com/products/oracle9i/pdf/OracleSpatial.pdf.
J. O'Rourke. Computational Geometry in C. Cambridge University Press, Cambridge, 1998.
D. Papadias and D. Arkoumanis. “Approximate processing of multiway spatial joins in very large databases,” Proceedings of 8th EDBT Conference, 179–196, 2002.
J.M. Patel and D.J. DeWitt. “Partition based spatial-merge join,” Proceedings of ACM SIGMOD Conference, 259–270, 1996.
H.H. Park, G.H. Cha, and C.W. Chung. “Multi-way spatial joins using R-trees: Methodology and performance evaluation,” Proceedings of 6th International Symposium on Spatial Databases (SSD'99), 229–250,1999.
D. Papadias, N. Mamoulis, and Y. Theodoridis. “Processing and optimization of multiway spatial joins using R-trees,” Proceedings of 18th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS'99), 44–55, 1999.
F.P. Preparata and M.I. Shamos. Computational Geometry: an Introduction. Springer-Verlag, New York, 1985.
A. Papadopoulos, P. Rigaux, and M. Scholl. “A performance evaluation of spatial join processing strategies,” Proceedings of 6th Symposium on Large Spatial Databases (SSD'99), 286–307, 1999.
N. Roussopoulos, S. Kelley, and F. Vincent. “Nearest neighbor queries,” Proceedings of ACM SIGMOD Conference, 71–79, 1995.
S. Shekhar, S. Chawla, S. Rivada, A. Fetterer, X. Lui, and C. Lu. “Spatial databases, accomplishments and research needs,” IEEE Transactions on Knowledge and Data Engineering, Vol. 11(1):45–55, 1999.
T. Seidl and H.P. Kriegel. “Optimal multi-step K-nearest neighbor search,” Proceedings of ACM SIGMOD Conference, 154–165, 1998.
Y. Shou, N. Mamoulis, H. Cao, D. Papadias, and D.W. Cheung. “Evaluation of iceberg distance joins,” Proceedings of 8th Symposium on Spatial and Temporal Databases (SSTD'03), 270–288, 2003.
H. Shin, B. Moon, and S. Lee. “Adaptive multi-stage distance join processing,” Proceedings of ACM SIGMOD Conference, 343–354, 2000.
C. Yang and K.I. Lin. “An index structure for improving nearest closest pairs and related join queries in spatial databases,” Proceedings of International Database Engineering and Applications Symposium (IDEAS'02), 140–149, 2002.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Corral, A., Manolopoulos, Y., Theodoridis, Y. et al. Multi-Way Distance Join Queries in Spatial Databases. GeoInformatica 8, 373–402 (2004). https://doi.org/10.1023/B:GEIN.0000040832.25622.8d
Issue Date:
DOI: https://doi.org/10.1023/B:GEIN.0000040832.25622.8d