Abstract
Spatial outlier detection is usually done in local neighbourhoods. We use an interpretable similarity measure to calculate the similarity between the regions of influence of spatial objects using spatial and non spatial attributes and relations both in the semantic description and in the similarity measure. A systematic method for similarity threshold selection is presented that can be used to categorize objects by their behavioral pattern in semantically similar neighborhoods. This paper’s main contribution is developing a multivariate two level outlier detection method. Real world data is used to evaluate our method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ramaswamy S, Rastogi R, Shim K (2000) Efficient algorithms for mining outliers from large data sets. In: Proceedings of the ACM SIGMOD
Bay SD, Schwabacher M (2003) Mining distance-based outliers in near linear time with randomization and a simple pruning. In: Proceedings of the ACM SIGKDD
Hu T, Sung Y (2003) Detecting pattern-based outliers. Science Direct- Elsevier, Amsterdam
Shekhar S, Lu C, Zhang P (2001) Detecting graph-based spatial outlier: algorithms and applications (a summary of results). Technical report Computer Science and Engineering Department, UMN, pp 01–014
Adam NR, Janeja VP, Atluri V (2004) Neighborhood based detection of anomalies in high dimensional spatio-temporal Sensor Datasets. ACM symposium on applied computing
Barua S, Alhajj R (2007) A parallel multi-scale region outlier mining algorithm for meteorological data. In: Proceedings of the ACM GIS
Lu CT, Kou Y, Zhao J, Chen L (2006) Detecting and tracking regional outliers in meteorological data. Science Direct, Elsevier, Amsterdam
Jin W, Jiang Y, Qian W, Tung AKH (2006) Mining outliers in spatial networks. In: Proceedings of international conference on database systems for advanced applications
Kou Y, Lu CT, Chen D (2006) Spatial weighted outlier detection. In: Proceedings of the SDM
Janeja VP, Atluri V, Adam NR (2004) Detecting anomalous geospatial trajectories through spatial characterization and spatio-semantic associations. ACM symposium on applied computing
Janeja VP (2007) Anomaly detection in heterogeneous datasets. Rutgers, The State University of New Jersey, Newark
McCane B, Alber M (2007) Distance functions for categorical and mixed variables. Patt Recog Lett 29:985–993
Chandola V, Boriah SH, Kumar V (2008) Understanding categorical similarity measures for outlier detection. CS technical report 08-008, Computer Science Department, University of Minnesota
Yoo C, MacInnis DJ (2004) Same or different? how distance and variation affect similarity judgments. Psychol Mark 21:209–277. doi:10.1002/mar.20002
Okabe A, Boots B, Sugihara K, Chiu S (2000) Spatial tessellations: concepts and applications of Voronoi diagrams. Wiley, West Sussex, pp 291–410
Quinlan J (1993) C4.5: programs for machine learning. Morgan Kaufmann, San Fransisco
Barnett V, Lewis T (1994) Outliers in statistical data, 3rd edn. Wiley, West Sussex
Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceedings of very large databases
Jain A, Dubes R (1988) Algorithms for clustering data. Prentice Hall, Upper Saddle River
Kullback S, Leibler R (1951) On information and sufficiency. Ann Math Stat 22:786
Lin J (1991) Divergence measures based on the Shannon entropy. IEEE Trans Inf Theo 37(1):145–151
Rached Z, Alajaji F, Campbell L (2001) Rényis divergence and entropy rates for finite alphabet markov sources. IEEE Trans Inf Theo 47(4):1553–1561
Le SQ, Ho TB (2005) An association-based dissimilarity measure for categorical data. Elsevier Patt Recog Lett 26:2549–2557
Krumhansl CL (1978) Concerning the applicability of geometric models to similarity data: the interrelationship between similarity and spatial density. Psychol Rev 85:445–63
Appleman IB, Mayzner P (1982) Application of geometric models to letter recognition: Distance and density. J Exp Psychol Gen 111:60–100
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media B.V.
About this paper
Cite this paper
Azam, F., Baraani-Dastjerdi, A. (2012). A Multivariate Spatial Outlier Detection Method Based on Semantic Similarity. In: J. (Jong Hyuk) Park, J., Chao, HC., S. Obaidat, M., Kim, J. (eds) Computer Science and Convergence. Lecture Notes in Electrical Engineering, vol 114. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2792-2_21
Download citation
DOI: https://doi.org/10.1007/978-94-007-2792-2_21
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-2791-5
Online ISBN: 978-94-007-2792-2
eBook Packages: EngineeringEngineering (R0)