skip to main content
10.1145/1459359.1459421acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Annotating personal albums via web mining

Authors Info & Claims
Published:26 October 2008Publication History

ABSTRACT

Nowadays personal albums are becoming more and more popular due to the explosive growth of digital image capturing devices. An effective automatic annotation system for personal albums is desired for both efficient browsing and search. Existing research on image annotation evolves through two stages: learning-based methods and web-based methods. Learning-based methods attempt to learn classifiers or joint probabilities between images and concepts, which are difficult to handle large-scale concept sets due to the lack of training data. Web-based methods leverage web image data to learn relevant annotations, which greatly expand the scale of concepts. However, they still suffer two problems: the query image lacks prior knowledge and the annotations are often noisy and incoherent. To address the above issues, we propose a web-based annotation approach to annotate a collection of photos simultaneously, instead of annotating them independently, by leveraging the abundant correlations among the photos. A multi-graph similarity propagation based semi-supervised learning (MGSP-SSL) algorithm is proposed to suppress the noises in the initial annotations from the Web. Experiments on real personal albums show that the proposed approach outperforms existing annotation methods.

References

  1. J, Sivic., A, Zisserman. Video Google: A text retrieval approach to object matching in video. In Proceedings of International Conference on Computer Vision (ICCV), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Li and J. Z. Wang, Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach. IEEE Trans. On Pattern Analysis and Machine Intelligence, 2003. 25(19): p.1075--1088. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. C. Cusano, G. Ciocca, and R. Schettini. Image Annotation Using SVM. Proceedings of Internet Imaging, Vol. SPIE 5304. 2004.Google ScholarGoogle Scholar
  4. S. L. Feng, R. Manmatha, and V. Lavrenko. Multiple bernoulli relevance models for image and video annotation. In Proc. of CVPR, Washington, DC, June, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. D. M. Blei and M. I. Jordan. Modeling annotated data. In Proc. SIGIR, Toronto, July. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. Wang, F. Jing, L. Zhang and H.J. Zhang. Scalable search-based image annotation of personal images. In Proceedings of the 8th ACM international workshop on Multimedia information retrieval. ACM Press New York, NY, USA, 2006, 269--278 Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. X. Wang, L. Zhang, F. Jing and W. Y. Ma. AnnoSearch: Image Auto-Annotation by Search. International Conference on Computer Vision and Pattern Recognition. Washington, DC, 2006, 1483--1490. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. X. Rui, M. Li, Z. Li, W. Y. Ma, Yu, N. Bipartite Graph Reinforcement Model for web image annotation. In Proceedings of the 15th Annual ACM International Conference on Multimedia, 2007 Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Cui, F. Wen, R. Xiao, Y. Tian, X. Tang. EasyAlbum: An Interactive Photo Annotation System Based on Face Clustering and Re-ranking. In Proceedings of SIGCHI, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. L. Zhang, L. Chen, M. Li, H. Zhang. Automated annotation of human faces in family albums. In Proceedings of ACM Multimedia, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. Zhou, J. Huang, and B. Schölkopf. Learning with Local and global consistency. 18th Annual Conference on Neural Information Processing System, 2003.Google ScholarGoogle Scholar
  12. J. Jeon, V. Lavrenko and R. Manmatha. Automatic Image Annotation and Retrieval Using Cross-media Relevance Models. In Proc. of SIGIR, Toronto, July 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J Philbin, O Chum, M Isard, J Sivic, A Zisserman. Object retrieval with large vocabularies and fast spatial matching. CVPR, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  14. M. Wang, X. Hua, X. Yuan, Y. Song and L. Dai. Optimizing multi-graph learning: towards a unified video annotation scheme. Proceedings of the 15th international conference on Multimedia, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Tang, X. Hua, T. Mei, G. Qi, S. Li and X. Wu. Temporally Consistent Gaussian Random Field for Video Semantic Analysis. IEEE International Conference on Image Processing, 2007.Google ScholarGoogle Scholar
  16. V. Lavrenko, R. Manmatha and J. Jeon. A Model for Learning the Semantics of Pictures. In Proc. NIPS, 2003.Google ScholarGoogle Scholar
  17. X. He, W. Y. Ma and H. J. Zhang. Learning an image manifold for retrieval. In Proc. of ACM international Conference On Multimedia, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. R. L. Cilibrasi, P. M. B. Vitányi. The google similarity distance. IEEE Trans. on Knowledge and data engineering, 2007 Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Liu, M. Li, W. Y. Ma, Q. Liu and H. Lu An Adaptive Graph Model for Automatic Image Annotation. ACM Workshop on Multimedia Information Retrieval (MIR), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. P. Duygulu and K. Barnard. Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In Proc. of ECCV, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Kandola, J. Shawe-Taylor, N. Cristianini. Learning Semantic Similarity. Annual Conference on Neural Information Processing System, 2003.Google ScholarGoogle Scholar
  22. X. Wang, W. Y. Ma, G. Xue, X. Li. Multi-Model Similarity Propagation and its Application for Web Image Retrieval. In Proceedings of ACM International Conference on Multimedia, 2004 Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. D. Lowe. Local feature view clustering for 3D object recognition. In Proc. CVPR, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  24. H. Lejsek, F. Ásmundsson, B. Jónsson, L. Amsaleg. Scalability of local image descriptors: a comparative study. ACM Multimedia 2006: 589--598. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. Boll, P. Sandhaus, A. Scherp, U. Westermann. Semantics, content, and structure of many for the creation of personal photo albums. ACM Multimedia 2007: 641--650 Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. X. Lian, L. Chen, J. X. Yu, G. Wang, G. Yu. Similarity Match Over High Speed Time-Series Streams. ICDE 2007: 1086--1095Google ScholarGoogle Scholar
  27. F. Golshani. EIC's Message: Multimedia is Correlated Media. IEEE MultiMedia 11(1): (2004) Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. A. Zunjarwad, H. Sundaram, L. Xie. Contextual wisdom: social relations and correlations for multimedia event annotation. ACM Multimedia 2007: 615--624 Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Y. Lin, H. Sundaram, Y. Chi, J. Tatemura, B. Tseng Detecting splogs via temporal dynamics using self-similarity analysis. TWEB 2(1): (2008)q Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. Cooper, J. Foote, A. Girgensohn, L. Wilcox. Temporal event clustering for digital photo collections. TOMCCAP 1(3): 269--288 (2005) Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. R. Zhang, Z. Zhang, M. Li, W. Y. Ma, H. Zhang. A Probabilistic Semantic Model for Image Annotation and Multi-Modal Image Retrieval. ICCV 2005: 846--851 Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. W. Klas, R. King. Context-Aware Multimedia. Encyclopedia of Multimedia 2006Google ScholarGoogle Scholar
  33. L. Hardman, J. Ossenbruggen. Creating meaningful multimedia presentations. ISCAS 2006Google ScholarGoogle Scholar
  34. J. Liu, B. Wang, M. Li, Z. Li, W. Y. Ma, H. Lu and S. Ma. Dual Cross-Media Relevance Model for Image Annotation. In Proceedings of the 15th Annual ACM International Conference on Multimedia 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. E. Chang, et al. CBSA: Content-Based Soft Annotation for Multimodal Image Retrieval Using Bayes Point Machines. CirSysVideo, 2003. 13(1): p. 26--38. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Annotating personal albums via web mining

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '08: Proceedings of the 16th ACM international conference on Multimedia
      October 2008
      1206 pages
      ISBN:9781605583037
      DOI:10.1145/1459359

      Copyright © 2008 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 October 2008

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader