Skip to main content

A Multivariate Spatial Outlier Detection Method Based on Semantic Similarity

  • Conference paper
  • First Online:
Computer Science and Convergence

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 114))

  • 1404 Accesses

Abstract

Spatial outlier detection is usually done in local neighbourhoods. We use an interpretable similarity measure to calculate the similarity between the regions of influence of spatial objects using spatial and non spatial attributes and relations both in the semantic description and in the similarity measure. A systematic method for similarity threshold selection is presented that can be used to categorize objects by their behavioral pattern in semantically similar neighborhoods. This paper’s main contribution is developing a multivariate two level outlier detection method. Real world data is used to evaluate our method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ramaswamy S, Rastogi R, Shim K (2000) Efficient algorithms for mining outliers from large data sets. In: Proceedings of the ACM SIGMOD

    Google Scholar 

  2. Bay SD, Schwabacher M (2003) Mining distance-based outliers in near linear time with randomization and a simple pruning. In: Proceedings of the ACM SIGKDD

    Google Scholar 

  3. Hu T, Sung Y (2003) Detecting pattern-based outliers. Science Direct- Elsevier, Amsterdam

    Google Scholar 

  4. Shekhar S, Lu C, Zhang P (2001) Detecting graph-based spatial outlier: algorithms and applications (a summary of results). Technical report Computer Science and Engineering Department, UMN, pp 01–014

    Google Scholar 

  5. Adam NR, Janeja VP, Atluri V (2004) Neighborhood based detection of anomalies in high dimensional spatio-temporal Sensor Datasets. ACM symposium on applied computing

    Google Scholar 

  6. Barua S, Alhajj R (2007) A parallel multi-scale region outlier mining algorithm for meteorological data. In: Proceedings of the ACM GIS

    Google Scholar 

  7. Lu CT, Kou Y, Zhao J, Chen L (2006) Detecting and tracking regional outliers in meteorological data. Science Direct, Elsevier, Amsterdam

    Google Scholar 

  8. Jin W, Jiang Y, Qian W, Tung AKH (2006) Mining outliers in spatial networks. In: Proceedings of international conference on database systems for advanced applications

    Google Scholar 

  9. Kou Y, Lu CT, Chen D (2006) Spatial weighted outlier detection. In: Proceedings of the SDM

    Google Scholar 

  10. Janeja VP, Atluri V, Adam NR (2004) Detecting anomalous geospatial trajectories through spatial characterization and spatio-semantic associations. ACM symposium on applied computing

    Google Scholar 

  11. Janeja VP (2007) Anomaly detection in heterogeneous datasets. Rutgers, The State University of New Jersey, Newark

    Google Scholar 

  12. McCane B, Alber M (2007) Distance functions for categorical and mixed variables. Patt Recog Lett 29:985–993

    Google Scholar 

  13. Chandola V, Boriah SH, Kumar V (2008) Understanding categorical similarity measures for outlier detection. CS technical report 08-008, Computer Science Department, University of Minnesota

    Google Scholar 

  14. Yoo C, MacInnis DJ (2004) Same or different? how distance and variation affect similarity judgments. Psychol Mark 21:209–277. doi:10.1002/mar.20002

    Google Scholar 

  15. Okabe A, Boots B, Sugihara K, Chiu S (2000) Spatial tessellations: concepts and applications of Voronoi diagrams. Wiley, West Sussex, pp 291–410

    Google Scholar 

  16. Quinlan J (1993) C4.5: programs for machine learning. Morgan Kaufmann, San Fransisco

    Google Scholar 

  17. Barnett V, Lewis T (1994) Outliers in statistical data, 3rd edn. Wiley, West Sussex

    MATH  Google Scholar 

  18. Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceedings of very large databases

    Google Scholar 

  19. Jain A, Dubes R (1988) Algorithms for clustering data. Prentice Hall, Upper Saddle River

    MATH  Google Scholar 

  20. Kullback S, Leibler R (1951) On information and sufficiency. Ann Math Stat 22:786

    Article  MathSciNet  Google Scholar 

  21. Lin J (1991) Divergence measures based on the Shannon entropy. IEEE Trans Inf Theo 37(1):145–151

    Article  MATH  Google Scholar 

  22. Rached Z, Alajaji F, Campbell L (2001) Rényis divergence and entropy rates for finite alphabet markov sources. IEEE Trans Inf Theo 47(4):1553–1561

    Article  MathSciNet  MATH  Google Scholar 

  23. Le SQ, Ho TB (2005) An association-based dissimilarity measure for categorical data. Elsevier Patt Recog Lett 26:2549–2557

    Google Scholar 

  24. Krumhansl CL (1978) Concerning the applicability of geometric models to similarity data: the interrelationship between similarity and spatial density. Psychol Rev 85:445–63

    Article  Google Scholar 

  25. Appleman IB, Mayzner P (1982) Application of geometric models to letter recognition: Distance and density. J Exp Psychol Gen 111:60–100

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fatemeh Azam .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer Science+Business Media B.V.

About this paper

Cite this paper

Azam, F., Baraani-Dastjerdi, A. (2012). A Multivariate Spatial Outlier Detection Method Based on Semantic Similarity. In: J. (Jong Hyuk) Park, J., Chao, HC., S. Obaidat, M., Kim, J. (eds) Computer Science and Convergence. Lecture Notes in Electrical Engineering, vol 114. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2792-2_21

Download citation

  • DOI: https://doi.org/10.1007/978-94-007-2792-2_21

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-007-2791-5

  • Online ISBN: 978-94-007-2792-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics