ABSTRACT
Although tagging has become increasingly popular in online image and video sharing systems, tags are known to be noisy, ambiguous, incomplete and subjective. These factors can seriously affect the precision of a social tag-based web retrieval system. Therefore improving the precision performance of these social tag-based web retrieval systems has become an increasingly important research topic. To this end, we propose a shared subspace learning framework to leverage a secondary source to improve retrieval performance from a primary dataset. This is achieved by learning a shared subspace between the two sources under a joint Nonnegative Matrix Factorization in which the level of subspace sharing can be explicitly controlled. We derive an efficient algorithm for learning the factorization, analyze its complexity, and provide proof of convergence. We validate the framework on image and video retrieval tasks in which tags from the LabelMe dataset are used to improve image retrieval performance from a Flickr dataset and video retrieval performance from a YouTube dataset. This has implications for how to exploit and transfer knowledge from readily available auxiliary tagging resources to improve another social web retrieval system. Our shared subspace learning framework is applicable to a range of problems where one needs to exploit the strengths existing among multiple and heterogeneous datasets.
Supplemental Material
- http://code.google.com/apis/youtube/overview.html. Accessed in Oct, 2009.Google Scholar
- http://www.flickr.com/services/api/. Accessed in July, 2009.Google Scholar
- H.D. Abdulla, M. Polovincak, and V. Snasel. Search results clustering using nonnegative matrix factorization (nmf). ASONAM '09, pages 320--323, July 2009. Google ScholarDigital Library
- M.W. Berry and M. Browne. Email surveillance using non-negative matrix factorization. Computational & Mathematical Organization Theory, 11(3):249--264, 2005. Google ScholarDigital Library
- R. Caruana. Multitask learning. Machine Learning, 28(1):41--75,1997. Google ScholarDigital Library
- A.P. Dempster, N.M. Laird, D.B. Rubin, et al. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1):1--38, 1977.Google ScholarCross Ref
- L. Fei-Fei, R. Fergus, and P. Perona. One-shot learning of object categories. PAMI, 28(4):594--611, 2006. Google ScholarDigital Library
- S.A. Golder and B.A. Huberman. Usage patterns of collaborative tagging systems. Journal of Information Science, 32(2):198, 2006. Google ScholarDigital Library
- D.R. Hardoon, S. Szedmak, and J. Shawe-Taylor. Canonical correlation analysis: an overview with application to learning methods. Neural Computation, 16(12):2639--2664, 2004. Google ScholarDigital Library
- P.O. Hoyer. Non-negative matrix factorization with sparseness constraints. The Journal of Machine Learning Research, 5:1457--1469, 2004. Google ScholarDigital Library
- M.S. Kankanhalli and Y. Rui. Application potential of multimedia information retrieval. Proceedings of the IEEE, 96(4):712, 2008.Google ScholarCross Ref
- J.R. Kettenring. Canonical analysis of several sets of variables. Biometrika, 58(3):433--451, 1971.Google ScholarCross Ref
- D.D. Lee and H.S. Seung. Algorithms for non-negative matrix factorization. In Advances in Neural Information Processing, 2000.Google Scholar
- X. Li, C. G. M. Snoek, and M.Worring. Learning social tag relevance by neighbor voting. IEEE Transactions on Multimedia, in press, 2009. Google ScholarDigital Library
- X. Li, C.G.M. Snoek, and M. Worring. Annotating images by harnessing worldwide user-tagged photos. ICASSP. Taipei, Taiwan,2009. Google ScholarDigital Library
- Y.R. Lin, H. Sundaram, M. De Choudhury, and A. Kelliher. Temporal patterns in social media streams: Theme discovery and evolution using joint analysis of content and context. In ICME 2009, pages 1456--1459, 2009. Google ScholarDigital Library
- K. V. Mardia, J. M. Bibby, and J. T. Kent. Multivariate analysis.Academic Press, New York, 1979.Google Scholar
- C. Marlow, M. Naaman, D. Boyd, and M. Davis. Ht06, tagging paper, taxonomy, flickr, academic article, toread. Proceedings of the seventeenth Conference on Hypertext and Hypermedia, pages 31--40,2006. Google ScholarDigital Library
- S.J. Pan and Q. Yang. A survey on transfer learning. Technical Report HKUST-CS08-08, Department of Computer Science and Engineering, HKUST, Hong Kong, China, 2008.Google Scholar
- R. Raina, A. Battle, H. Lee, B. Packer, and A.Y. Ng. Self-taught learning: Transfer learning from unlabeled data. Proceedings of the 24th International Conference on Machine Learning, page 766, 2007. Google ScholarDigital Library
- B.C. Russell, A. Torralba, K.P. Murphy, and W.T. Freeman. Labelme:a database and web-based tool for image annotation. International Journal of Computer Vision, 77(1):157--173, 2008. Google ScholarDigital Library
- G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5):513--523, 1988. Google ScholarDigital Library
- F. Shahnaz, M.W. Berry, V.P. Pauca, and R.J. Plemmons. Document clustering using nonnegative matrix factorization. Information Processing and Management, 42(2):373--386, 2006. Google ScholarDigital Library
- B. Sigurbjörnsson and R. Van Zwol. Flickr tag recommendation based on collective knowledge. Proceeding of ACM International World Wide Web Conference, 2008. Google ScholarDigital Library
- C. Wang, F. Jing, L. Zhang, and H.J. Zhang. Scalable search-based image annotation. Multimedia Systems, 14(4):205--220, 2008.Google ScholarDigital Library
- X. Wang, C. Pal, and A. McCallum. Generalized component analysis for text with heterogeneous attributes. Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, page 803, 2007. Google ScholarDigital Library
- L. Wu, L. Yang, N. Yu, and X.S. Hua. Learning to tag. Proceedings of the 18th International Conference on World Wide Web, pages 361--370, 2009. Google ScholarDigital Library
- Z.Wu, C.W. Cheng, and C. Li. Social and semantics analysis via nonnegative matrix factorization. Proceedings of the 17th International Conference on World Wide Web, 2008. Google ScholarDigital Library
Index Terms
- Nonnegative shared subspace learning and its application to social media retrieval
Recommendations
Automatic creation of photo books from stories in social media
WSM '10: Proceedings of second ACM SIGMM workshop on Social mediaPhotos are a special way to tell stories of our best memories. The representation of those photos in appealing physical photo books is highly appreciated by many people. Today, many photos are shared via social networking sites, where people upload ...
Automatic creation of photo books from stories in social media
Special section on ACM multimedia 2010 best paper candidates, and issue on social mediaPhotos are a special way to tell stories of our best memories and moments. The representation of those photos in appealing physical photo books is highly appreciated by many people. Today, many photos are shared via social networking sites, where people ...
Social media engagement
This study unearths the motives for YouTube user engagement that has been conceptualized as active participation and passive content consumption. In light of the Uses and Gratifications framework, a sample of 1143 registered YouTube users completed ...
Comments