ABSTRACT
Automatic folksonomy construction from tags has attracted much attention recently. However, inferring hierarchical relations between concepts from tags has a drawback in that it is difficult to distinguish between more popular and more general concepts. Instead of tags we propose to use user-specified relations for learning folksonomy. We explore two statistical frameworks for aggregating many shallow individual hierarchies, expressed through the collection/set relations on the social photosharing site Flickr, into a common deeper folksonomy that reflects how a community organizes knowledge. Our approach addresses a number of challenges that arise while aggregating information from diverse users, namely noisy vocabulary, and variations in the granularity level of the concepts expressed. Our second contribution is a method for automatically evaluating learned folksonomy by comparing it to a reference taxonomy, e.g., the Web directory created by the Open Directory Project. Our empirical results suggest that user-specified relations are a good source of evidence for learning folksonomies.
- M. Abramowitz and I. A. Stegun. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. Dover, New York, ninth dover printing, tenth gpo printing edition, 1964. Google ScholarDigital Library
- R. K. Ahuja, T. L. Magnanti, and J. B. Orlin. Network Flows: Theory, Algorithms, and Applications. Prentice Hall, Englewood Cliffs, NJ, 1993. Google ScholarDigital Library
- C. H. Brooks and N. Montanez. Improved annotation of the blogosphere via autotagging and hierarchical clustering. In Proc. of the 15th international conference on World Wide Web, pages 625--632, New York, NY, USA, 2006. ACM. Google ScholarDigital Library
- P. Cimiano, A. Hotho, and S. Staab. Learning concept hierarchies from text corpora using formal concept analysis. J. Artif. Intell. Res. (JAIR), 24:305--339, 2005. Google ScholarDigital Library
- S. A. Golder and B. A. Huberman. Usage patterns of collaborative tagging systems. J. Inf. Sci., 32(2):198--208, April 2006. Google ScholarDigital Library
- M. A. Hearst. Automatic acquisition of hyponyms from large text corpora. In Proc. of ACL-92, pages 539--545, Morristown, NJ, USA, 1992. Google ScholarDigital Library
- P. Heymann and H. Garcia-Molina. Collaborative creation of communal hierarchical taxonomies in social tagging systems. Technical Report 2006-10, Stanford University, Stanford, CA, USA, April 2006.Google Scholar
- C. Kemp, A. Perfors, and J. B. Tenenbaum. Learning domain structures. In Proc. of the 26th Annual Conference of the Cognitive Science Society, 2005.Google Scholar
- K. Lerman. Social information processing in news aggregation. IEEE Internet Computing: special issue on Social Search, 11(6):16--28, November 2007. Google ScholarDigital Library
- A. Maedche and S. Staab. Measuring similarity between ontologies. In EKAW, pages 251--263, 2002. Google ScholarDigital Library
- B. Markines, L. Stoilova, and F. Menczer. Bookmark hierarchies and collaborative recommendation. In Proc. of AAAI, 2006. Google ScholarDigital Library
- C. Marlow, M. Naaman, D. Boyd, and M. Davis. Ht06, tagging paper, taxonomy, flickr, academic article, to read. In HYPERTEXT '06: Proceedings of the seventeenth conference on Hypertext and hypermedia, pages 31--40, New York, NY, USA, 2006. ACM. Google ScholarDigital Library
- P. Mika. Ontologies are us: A unified model of social networks and semantics. J. Web Sem., 5(1):5--15, 2007. Google ScholarDigital Library
- M. Pasca. Acquisition of categorized named entities for web search. In Proc. of the 13rd ACM international conference on Information and knowledge management, pages 137--145, New York, NY, USA, 2004. Google ScholarDigital Library
- A. Plangprasopchok and K. Lerman. Exploiting social annotation for automatic resource discovery. In Proc. of AAAI workshop on Information Integration, 2007.Google Scholar
- M. Sanderson and W. B. Croft. Deriving concept hierarchies from text. In SIGIR, pages 206--213, 1999. Google ScholarDigital Library
- P. Schmitz. Inducing ontology from flickr tags. In Proc. of the Collaborative Web Tagging Workshop (WWW ÇS06), May 2006.Google Scholar
- B. Shneiderman. Computer science: Science 2.0. Science, 319(5868):1349--1350, March 2008.Google ScholarCross Ref
- R. Snow, D. Jurafsky, and A. Y. Ng. Semantic taxonomy induction from heterogenous evidence. In Proc. of ACL-06, pages 801--808, Morristown, NJ, USA, 2006. Association for Computational Linguistics. Google ScholarDigital Library
- L. Steels and E. Tisselli. Social tagging in community memories. In Proc. of AAAI symposium on Social Information Processing. AAAI, 2008.Google Scholar
- O. Udrea, L. Getoor, and R. J. Miller. Leveraging data and structure in ontology integration. In SIGMOD Conference, pages 449--460, 2007. Google ScholarDigital Library
- M. Zhou, S. Bao, X.Wu, and Y. Yu. An unsupervised model for exploring hierarchical semantics from social annotations. In ISWC/ASWC, pages 680--693, 2007. Google ScholarDigital Library
Index Terms
- Constructing folksonomies from user-specified relations on flickr
Recommendations
Growing a tree in the forest: constructing folksonomies by integrating structured metadata
KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data miningMany social Web sites allow users to annotate the content with descriptive metadata, such as tags, and more recently to organize content hierarchically. These types of structured metadata provide valuable evidence for learning how a community organizes ...
An Approach to Building High-Quality Tag Hierarchies from Crowdsourced Taxonomic Tag Pairs
SocInfo 2013: Proceedings of the 5th International Conference on Social Informatics - Volume 8238Building taxonomies for web content is costly. An alternative is to allow users to create folksonomies, collective social classifications. However, folksonomies lack structure and their use for searching and browsing is limited. Current approaches for ...
Structuring folksonomies with implicit tag relations
HT '12: Proceedings of the 23rd ACM conference on Hypertext and social mediaTagging systems allow users to assign arbitrary text labels (i.e., tags) to various types of resources, such as photos or web pages, to facilitate future retrieval and selective sharing of contents. The resulting system of classification is referred to ...
Comments