Skip to main content
Log in

Scalable landmark recognition using EXTENT

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

We have proposed the EXTENT system for automated photograph annotation using image content and context analysis. A key component of EXTENT is a Landmark recognition system called LandMarker. In this paper, we present the architecture of LandMarker. The content of a query photograph is analyzed and compared against a database of sample landmark images, to recognize any landmarks it contains. An algorithm is presented for comparing a query image with a sample image. Context information may be used to assist landmark recognition. Also, we show how LandMarker deals with scalability to allow recognition of a large number of landmarks. We have implemented a prototype of the system, and present empirical results on a large dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Amores J, Sebe N, Radeva P (2005) Fast spatial pattern discovery integrating boosting with constellations of contextual descriptors. In: Proceedings of the international conference on computer vision and pattern recognition

  2. Barnard K, Forsyth D (2000) Learning the semantics of words and pictures. In: International conference on computer vision, vol 2, pp 408–415

  3. Bartolini I, Ciaccia P, Patella M (2000) A sound algorithm for region-based image retrieval using an index. In: Proceedings of the 4th international workshop on query processing and multimedia issues in distributed systems

  4. Carson C, Thomas M, Belongie S, Hellerstein JM, Malik J (1999) Blobworld: a system for region-based image imdexing and retrieval. In: Proceedings of the international conference on visual information systems

  5. Chang EY (2005) EXTENT: fusing context, content, and semantic ontology for photo annotation. In: Workshop on computer vision meets databases (CVDB) in cooperation with ACM international conference on management of data (SIGMOD)

  6. Datar M, Immorlica N, Indyk P, Mirrokni V (2004) Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the 20th symposium on computational geometry

  7. Davis M, King S, Good N, Sarvas R (2004) From context to content: leveraging context to infer media metadata. In: Proceedings of the ACM international conference on multimedia

  8. Dey AK (2001) Understanding and using context. Personal Ubiquitous Comput J 5(1):4–7

    Article  Google Scholar 

  9. Diomidis DS (2003) Position-annotated photographs: a geotemporal web. IEEE Pervasive Comput 2(2):72–79

    Article  Google Scholar 

  10. Friedman N, Koller D (2001) Learning bayesian networks from data (tutorial). NIPS

  11. Gionis A, Indyk P, Motwani R (1999) Similarity search in high dimensions via hashing. VLDB J, pp 518–529

  12. Goh K-S, Chang EY, Cheng K-T (2001) SVM binary classifier ensembles for multi-class image classification. In: ACM international conference on information and knowledge management (CIKM), pp 395–402

  13. Grauman K, Darrell T (2005) Efficient image matching with distributions of local invariant features. In: Proceedings of the international conference on computer vision and pattern recognition

  14. Heckerman D, Shachter R (1994) Decision-theoretic foundations for causal reasoning. MSR-TR-94-11

  15. Indyk P, Thaper N (2003) Fast image retrieval via embeddings. In: 3rd Intl. workshop on statistical and computational theories of vision

  16. Ke Y, Sukthankar R, Huston L (2004) Efficient near-duplicate detection and sub-image retrieval. In: Proceedings of the ACM international conference on multimedia

  17. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110

    Article  Google Scholar 

  18. Lv Q, Charikar M, Li K (2004) Image similarity search with compact data structures. In: Proceedings of the thirteenth ACM conference on information and knowledge management. New York, NY, USA, pp 208–217

  19. Naaman M, Harada S, Wang Q, Garcia-Molina H, Paepcke A (2004) Context data in geo-referenced digital photo collections. In: Proceedings of the ACM international conference on multimedia

  20. Naaman M, Paepcke A, Garcia-Molina H (2003) From where to what: metadata sharing for digital photographs with geographic coordinates. In: International conference on cooperative information systems (CoopIS)

  21. Novick LR, Cheng PW (2004) Assessing interactive causal influence. Psychol Rev 111(2):455–485

    Article  Google Scholar 

  22. Rubner Y, Tomasi C, Guibas LJ (2000) The earth mover’s distance as a metric for image retrieval. Int J Comput Vis 40(2):99–121

    Article  MATH  Google Scholar 

  23. Schmid C, abd Svetlana Lazebnik GD, Mikolajczyk K (2005) Patter recognition with local invariant features. Handbook of pattern recognition and computer vision

  24. Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. In: Proceedings of the international conference on computer vision

  25. Tong S, Chang E (2001) Support vector machine active learning for image retrieval. In: Proceedings of ACM international conference on multimedia, pp 107–118

  26. Weber R, Mlivoncic M (2003) Efficient region-based image retrieval. In: Proceedings of the twelth ACM conference on information and knowledge management

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Arun Qamra.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Qamra, A., Chang, E.Y. Scalable landmark recognition using EXTENT. Multimed Tools Appl 38, 187–208 (2008). https://doi.org/10.1007/s11042-007-0178-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-007-0178-8

Keywords

Navigation