ABSTRACT
In this paper, we proposed a perspective Hierarchical Dirichlet Process (pHDP) model to deal with user-tagged image modeling. The contribution is two-fold. Firstly, we associate image features with image tags. Secondly, we incorporate the user's perspectives into the image tag generation process and introduce new latent variables to determine if an image tag is generated from user's perspectives or from the image content. Therefore, the model is able to extract both embedded semantic components and user's perspectives from user-tagged images. Based on the proposed pHDP model, we achieve automatic image tagging with users' perspective. Experimental results show that the pHDP model achieves better image tagging performance compared to state-of-the-art topic models.
- D.M. Blei, and M.I. Jordan, Modeling annotated data The 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, ACM, Toronto, Canada, 2003, pp. 127--134. Google ScholarDigital Library
- Henderson, J.M. and Hollingworth, A. High level scene perception. Annual Review of Psychology, 50:243--271, 1999.Google ScholarCross Ref
- C. Siagian and L. Itti, Rapid Biologically-Inspired Scene Classification Using Features Shared with Visual Attention, IEEE TPAMI, pp. 300--312, 2007. Google ScholarDigital Library
- A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, Content-based image rerieval at the end of the early years, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 12, pp. 1349--1380, 2000. Google ScholarDigital Library
- Y. Teh, M. Jordan, M. Beal, and D. Blei. Hierarchical Dirichlet process. Journal of the American Statistical Association, 101(476):1566--1581, 2006Google ScholarCross Ref
- K. Bischoff, C.S. Firan, W. Nejdl, and R. Paiu, Can All Tags be Used for Search?, CIKM'08, Napa Valley, California, USA, 2008, pp. 203--212. Google ScholarDigital Library
- S. Sen, S.K.T. Lam, A.M. Rashid, D. Cosley, D. Frankowski, J. Osterhouse, F.M. Harper, and J. Riedl, Tagging, communities, vocabulary, evolution, CSCW'06, Banff, Alberta, Canada, 2006. Google ScholarDigital Library
- Amr Ahmed, Eric P. Xing, William W. Cohen, Robert F. Murphy, Structured Correspondence topic models for mining captioned figures in biomedical literature, Proceedings of the 15th ACM SIGKDD International conference on Knowledge discovery and data mining, June 28-July 01, 2009, Paris, France. Google ScholarDigital Library
- X. Chen, C. Lu, Y. An, and P. Achananuparp. Probabilistic Models for Topic Learning from Images and Captions in Online Biomedical Literatures. In the Proceedings of 18th ACM Conference on Information and Knowledge Management (CIKM'09) Google ScholarDigital Library
- D. Zhou, J. Bian, S. Zheng, H. Zha, and C.L. Giles, Exploring Social Annotations for Information Retrieval, WWW 2008, Beijing, China, 2008, pp. 715--724. Google ScholarDigital Library
- C. Lu, X. Hu, X. Chen and J. Park. The topic-perspective model for social tagging systems, The 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'10), July 25--28, 2010, Washington D.C., USA. pp. 683--692. Google ScholarDigital Library
- Sivic, J., Zisserman, A.: Video Google: A Text Retrieval Approach to Object Matching in Videos. International Conference on Computer Vision. (2003) 1470-- 1477 Google ScholarDigital Library
- J. Matas, O. Chum, U. M., T. Pajdla. Robust wide baseline stereo from maximally stable extremal regions. In BMVC, 2002.Google ScholarCross Ref
Index Terms
- Perspective hierarchical dirichlet process for user-tagged image modeling
Recommendations
Tagging tagged images: on the impact of existing annotations on image tagging
CrowdMM '12: Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimediaCrowdsourcing has been widely used to generate metadata for multimedia resources. By presenting partially described resources to human annotators, resources are tagged yielding better descriptions. Although significant improvements in metadata quality ...
Image retagging
MM '10: Proceedings of the 18th ACM international conference on MultimediaOnline social media repositories such as Flickr and Zooomr allow users to manually annotate their images with freely-chosen tags, which are then used as indexing keywords to facilitate image search and other applications. However, these tags are ...
The accuracy and value of machine-generated image tags: design and user evaluation of an end-to-end image tagging system
CIVR '10: Proceedings of the ACM International Conference on Image and Video RetrievalAutomated image tagging is a problem of great interest, due to the proliferation of photo sharing services. Researchers have achieved considerable advances in understanding motivations and usage of tags, recognizing relevant tags from image content, and ...
Comments