Abstract
In social annotation systems, users label digital resources by using tags which are freely chosen textual descriptions. Tags are used to index, annotate and retrieve resource as an additional metadata of resource. Poor retrieval performance remains a major problem of most social tagging systems resulting from the severe difficulty of ambiguity, redundancy and less semantic nature of tags. Clustering method is a useful tool to address the aforementioned difficulties. Most of the researches on tag clustering are directly using traditional clustering algorithms such as K-means or Hierarchical Agglomerative Clustering on tagging data, which possess the inherent drawbacks, such as the sensitivity of initialization. In this paper, we instead make use of the approximate backbone of tag clustering results to find out better tag clusters. In particular, we propose an APProximate backbonE-based Clustering algorithm for Tags (APPECT).The main steps of APPECT are: (1) we execute the K-means algorithm on a tag similarity matrix for M times and collect a set of tag clustering results Z = C 1,C 2,...,C m; (2) we form the approximate backbone of Z by executing a greedy search; (3) we fix the approximate backbone as the initial tag clustering result and then assign the rest tags into the corresponding clusters based on the similarity. Experimental results on three real world datasets namely MedWorm, MovieLens and Dmoz demonstrate the effectiveness and the superiority of the proposed method against the traditional approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zong, Y., Xu, G.D., Jin, P., et al.: A local information passing clustering algorithm for tagging systems. In: The Second Workshop on Social Networks and Social Media Mining on the Web, Hong Kong, pp. 333–343 (2011)
Durao, F., Dolog, P.: Extending a hybrid tag-based recommender system with personalization. In: SAC 2010: Proceedings of the 2010 ACM Symposium on Applied Computing, pp. 1723–1727. ACM, New York (2010)
Jäschke, R., Marinho, L., Hotho, A., Schmidt-Thieme, L., Stumme, G.: Tag Recommendations in Folksonomies. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) PKDD 2007. LNCS (LNAI), vol. 4702, pp. 506–514. Springer, Heidelberg (2007)
Tso-Sutter, K.H.L., Marinho, L.B., Schmidt-Thieme, L.: Tag-aware recommender systems by fusion of collaborative filtering algorithms. In: SAC 2008: Proceedings of the 2008 ACM Symposium on Applied Computing, pp. 1995–1999. ACM, New York (2008)
Gemmell, J., Shepitsen, A., Mobasher, M., Burke, R.: Personalization in folksonomies based on tag clustering. In: Proceedings of the 6th Workshop on Intelligent Techniques for Web Personalization and Recommender Systems (July 2008)
Shepitsen, A., Gemmell, J., Mobasher, B., Burke, R.: Personalized recommendation in social tagging systems using hierarchical clustering. In: Proceedings of the 2008 ACM Conference on Recommender Systems, pp. 259–266. ACM (2008)
Zong, Y., Jiang, H., Li, M.C.: Approximate backbone guided reduction clustering algorithm. Journal of Electronics and Information Technology 31(2), 2953–2957 (2009)
Astrain, J.J., Echarte, F., Córdoba, A., Villadangos, J.: A Tag Clustering Method to Deal with Syntactic Variations on Collaborative Social Networks. In: Gaedke, M., Grossniklaus, M., Díaz, O. (eds.) ICWE 2009. LNCS, vol. 5648, pp. 434–441. Springer, Heidelberg (2009)
Simpson, E.: Clustering tags in enterprise and web folksonomies. HP Labs Technical Reports, citeulike: 2545406 (2008)
Boratto, L., Carta, S., Vargiu, E.: RATC: A Robust Automated Tag Clustering Technique. In: Di Noia, T., Buccafurri, F. (eds.) EC-Web 2009. LNCS, vol. 5692, pp. 324–335. Springer, Heidelberg (2009)
Zhou, J.L., Nie, X.J., Qin, L.J., et al.: Web clustering based on tag set similarity. Journal of Computers 6(1), 59–66 (2011)
Matteo, N.R., Peroni, S., Tamburini, F., et al.: A parametric architecture for tags clustering in folksonomic search engines. In 9th international Conference on Intelligent Systems Design and Applications, Pisa, Italy, pp. 279–282 (2009)
Chen, H., Dumais, S.: Bringing order to the web: Automatically categorizing search results. In: Proceedings of the SIGCHI Conference on Human Factors in Computting Systems, pp. 145–152. ACM (2000)
van Dam, J., Vandic, D., Hogenboom, F., Frasincar, F.: Searching and browsing tagspaces using the semantic tag clustering search framework. In: 2010 IEEE Fourth International Conference on Semantic Computing (ICSC), pp. 436–439. IEEE (2010)
Lehwark, P., Risi, S., Ultsch, A.: Visualization and clustering of tagged music data. Data Analysis, Machine Learning and Applications, 673–680 (2008)
Miao, G., Tatemura, J., Hsiung, W., Sawires, A., Moser, L.: Extracting data records from the web using tag path clustering. In: Proceedings of the 18th International Conference on World Wide Web, pp. 981–990. ACM (2009)
Giannakidou, E., Koutsonikola, V., Vakali, A., Kompatsiaris, Y.: Co-clustering tags and social data sources. In: The Ninth International Conference on Web-Age Information Management, pp. 317–324. IEEE (2008)
Nicola, R.D., Silvio, P., Fabio, T., et al.: Of mice and terms: Clustering algorithms on ambiguous terms in folksonomies. In: Proceeding of the 2010 ACM Symposium on Applied Computing SAC 2010, pp. 844–848 (2010)
Jiang, Y.X., Tang, C.J., Xu, K.K., et al.: Core-tag clustering for web2.0 based on multi-similarity measurements. In: The Joint International Conference on Asia-Pacific Web Conference (APWeb) and Web-Age Information Management (WAIM), Suzhou, China, pp. 222–233 (2009)
Zou, P., ZHou, Z.H., Chen, G.L.: Approximate backbone guided fast ant algorithm to QAP. Journal of Software 16(10), 1691–1698 (2005)
Jiang, H., Zhang, X.C., Chen, G.L.: Exclusive overall optimal solution of graph bipartition problem and backbone compute complexity. Chinese Science Bulletin 52(17), 2077–2081 (2007)
Jiang, H., Zhang, X.C., Chen, G.L.: Backbone analysis and algorithm design of QAP. Chinese Science 38(01), 1–14 (2008)
Guan, Z., Wang, C., Bu, J., Chen, C., Yang, K., Cai, D., He, X.: Document recommendation in social tagging services. In: Proceedings of the 19th International Conference on World Wide Web, pp. 391–400. ACM (2010)
Mika, P.: Ontologies Are US: A Unified Model of Social Networks and Semantics. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 522–536. Springer, Heidelberg (2005)
Lin, X., Guo, L., Zhao, Y.E.: Tag-bsed social interest discovery. In: Proceeding of the 17th International World Wide Web Conference (2008)
Sibson, R.: SLINK: An optimally efficient algorithm for single-link cluster method. Computer Journal 16(1), 30–34 (1973)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zong, Y., Xu, G., Jin, P., Zhang, Y., Chen, E., Pan, R. (2011). APPECT: An Approximate Backbone-Based Clustering Algorithm for Tags. In: Tang, J., King, I., Chen, L., Wang, J. (eds) Advanced Data Mining and Applications. ADMA 2011. Lecture Notes in Computer Science(), vol 7120. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25853-4_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-25853-4_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25852-7
Online ISBN: 978-3-642-25853-4
eBook Packages: Computer ScienceComputer Science (R0)