Abstract
Recently, there have been many opportunities to acquire text information as the quantity of electronic information increases. Data classification or clustering methods are widely adapted in order to acquire various information effectively from an enormous active text data set. However, ordinal clustering methods connect texts and many texts are concentrated into a single cluster so that we cannot see various information.
In this study, we propose a recursive clustering method to avoid such bias by integrating a set of texts, included in a cluster, into a single text. An interface that we can comprehend a result of clustering intuitively and can explore information is required to grasp an overview of data and to be led to a new idea. According to the experimental results, the proposed method could construct clusters that are not biased. Test subjects could find information widely by using a map visualizing clustering results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ohsawa, Y., Nara, Y.: Decision Process Modeling across Internet and Real World by Double Helical Model of Chance-Discovery. New Generation Computing 21(2), 109–122 (2003)
McCallum, A., Nigam, K.: A Comparison of Event Models for Naive Bayes Text Classification. In: Proc. of AAAI-1998 Workshop on Learning for Text Categorization, pp. 41–48 (1998)
Nigam, K., McCallum, A.K., Thrun, S., Mitchell, T.: Text Classification from Labeled and Unlabeled Documents using EM. Machine Learning 39, 103–134 (2000)
Tong, S., Koller, D.: Support Vector Machine Active Learning with Applications to Text Classification. Journal of Machine Learning Research 1, 45–66 (2001)
Lodhi, H., Saunders, C., Shawe-Taylor, J., Cristianini, N., Watkins, C.: Text Classification using String Kernels. Journal of Machine Learning Research 2, 419–444 (2002)
Begelman, G., Keller, P., Smadja, F.: Automated Tag Clustering: Improving search and exploration in the tag space. In: Proc. of Collaborative Web Tagging Workshop at WWW (2006)
Kobayashi, T., Misue, K., Shizuki, B., Tanaka, J.: Information Gathering Support Interface by the Overview Presentation of Web Search Results. In: Proc. of Asia-Pacific Symposium on Information Visualisation, vol. 60, pp. 103–108 (2006)
Sprenger, T.C., Brunella, R., Gross, M.H.: H-BLOB: A Hierarchical Visual Clustering Method Using Implicit Surfaces. In: Proc. of the 11th IEEE Conference on Visualization (2000)
Akhavi, M.S., Rahmati, M., Amini, N.N.: 3D Visualization of Hierarchical Clustered Web Search Results. In: Proc. of International Conference on Computer Graphics, Imaging and Visualisation, pp. 441–446 (2007)
Muhr, M., Sabol, V., Granitzer, M.: Scalable Recursive Top-Down Hierarchical Clustering Approach with implicit Model Selection for Textual Data Sets. In: IEEE Workshops on Database and Expert Systems Applications, pp. 15–19 (2010)
Nishikido, T., Sunayama, W., Nishihara, Y.: Valuable Change Detection in Keyword Map Animation. In: Gao, Y., Japkowicz, N. (eds.) AI 2009. LNCS, vol. 5549, pp. 233–236. Springer, Heidelberg (2009)
Newman, M.E.: Fast algorithm for detecting community structure in networks. Phys. Rev. E 69 066133, 1–5 (2004)
Nohuddin, P.N.E., Christley, R., Coenen, F., Setzkorn, C., Sunayama, W.: Trend Mining and Visualisation in Social Networks. In: Proc. of the Thirty-first SGAI International Conference on Innovative Techniques and Application of Artificial Intelligence(AI 2011), pp. 269–282. Springer (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Sunayama, W., Hamaoka, S., Okuda, K. (2013). Map Interface for a Text Data Set by Recursive Clustering. In: Ohsawa, Y., Abe, A. (eds) Advances in Chance Discovery. Studies in Computational Intelligence, vol 423. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30114-8_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-30114-8_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30113-1
Online ISBN: 978-3-642-30114-8
eBook Packages: EngineeringEngineering (R0)