Map Interface for a Text Data Set by Recursive Clustering

Sunayama, Wataru; Hamaoka, Shuhei; Okuda, Kiyoshi

doi:10.1007/978-3-642-30114-8_5

Wataru Sunayama³,
Shuhei Hamaoka³ &
Kiyoshi Okuda³

Part of the book series: Studies in Computational Intelligence ((SCI,volume 423))

489 Accesses

Abstract

Recently, there have been many opportunities to acquire text information as the quantity of electronic information increases. Data classification or clustering methods are widely adapted in order to acquire various information effectively from an enormous active text data set. However, ordinal clustering methods connect texts and many texts are concentrated into a single cluster so that we cannot see various information.

In this study, we propose a recursive clustering method to avoid such bias by integrating a set of texts, included in a cluster, into a single text. An interface that we can comprehend a result of clustering intuitively and can explore information is required to grasp an overview of data and to be led to a new idea. According to the experimental results, the proposed method could construct clusters that are not biased. Test subjects could find information widely by using a map visualizing clustering results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Text Clustering Algorithm to Detect Basic Level Categories in Texts

Large Scale Text Clustering Method Study Based on MapReduce

A Frequent Term-Based Multiple Clustering Approach for Text Documents

References

Ohsawa, Y., Nara, Y.: Decision Process Modeling across Internet and Real World by Double Helical Model of Chance-Discovery. New Generation Computing 21(2), 109–122 (2003)
Article MATH Google Scholar
McCallum, A., Nigam, K.: A Comparison of Event Models for Naive Bayes Text Classification. In: Proc. of AAAI-1998 Workshop on Learning for Text Categorization, pp. 41–48 (1998)
Google Scholar
Nigam, K., McCallum, A.K., Thrun, S., Mitchell, T.: Text Classification from Labeled and Unlabeled Documents using EM. Machine Learning 39, 103–134 (2000)
Article MATH Google Scholar
Tong, S., Koller, D.: Support Vector Machine Active Learning with Applications to Text Classification. Journal of Machine Learning Research 1, 45–66 (2001)
Google Scholar
Lodhi, H., Saunders, C., Shawe-Taylor, J., Cristianini, N., Watkins, C.: Text Classification using String Kernels. Journal of Machine Learning Research 2, 419–444 (2002)
MATH Google Scholar
Begelman, G., Keller, P., Smadja, F.: Automated Tag Clustering: Improving search and exploration in the tag space. In: Proc. of Collaborative Web Tagging Workshop at WWW (2006)
Google Scholar
Kobayashi, T., Misue, K., Shizuki, B., Tanaka, J.: Information Gathering Support Interface by the Overview Presentation of Web Search Results. In: Proc. of Asia-Pacific Symposium on Information Visualisation, vol. 60, pp. 103–108 (2006)
Google Scholar
Sprenger, T.C., Brunella, R., Gross, M.H.: H-BLOB: A Hierarchical Visual Clustering Method Using Implicit Surfaces. In: Proc. of the 11th IEEE Conference on Visualization (2000)
Google Scholar
Akhavi, M.S., Rahmati, M., Amini, N.N.: 3D Visualization of Hierarchical Clustered Web Search Results. In: Proc. of International Conference on Computer Graphics, Imaging and Visualisation, pp. 441–446 (2007)
Google Scholar
Muhr, M., Sabol, V., Granitzer, M.: Scalable Recursive Top-Down Hierarchical Clustering Approach with implicit Model Selection for Textual Data Sets. In: IEEE Workshops on Database and Expert Systems Applications, pp. 15–19 (2010)
Google Scholar
Nishikido, T., Sunayama, W., Nishihara, Y.: Valuable Change Detection in Keyword Map Animation. In: Gao, Y., Japkowicz, N. (eds.) AI 2009. LNCS, vol. 5549, pp. 233–236. Springer, Heidelberg (2009)
Chapter Google Scholar
Newman, M.E.: Fast algorithm for detecting community structure in networks. Phys. Rev. E 69 066133, 1–5 (2004)
Google Scholar
Nohuddin, P.N.E., Christley, R., Coenen, F., Setzkorn, C., Sunayama, W.: Trend Mining and Visualisation in Social Networks. In: Proc. of the Thirty-first SGAI International Conference on Innovative Techniques and Application of Artificial Intelligence(AI 2011), pp. 269–282. Springer (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Information Sciences, Hiroshima City University, 3-4-1 Ozuka-Higashi, Asa-Minami-Ku, Hiroshima, 731-3194, Japan
Wataru Sunayama, Shuhei Hamaoka & Kiyoshi Okuda

Authors

Wataru Sunayama
View author publications
You can also search for this author in PubMed Google Scholar
Shuhei Hamaoka
View author publications
You can also search for this author in PubMed Google Scholar
Kiyoshi Okuda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wataru Sunayama .

Editor information

Editors and Affiliations

The University of Tokyo, Hongo 7-3-1, Tokyo, 113-8656, Japan
Yukio Ohsawa
Faculty of Letters, Chiba University, Chiba, Japan
Akinori Abe

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sunayama, W., Hamaoka, S., Okuda, K. (2013). Map Interface for a Text Data Set by Recursive Clustering. In: Ohsawa, Y., Abe, A. (eds) Advances in Chance Discovery. Studies in Computational Intelligence, vol 423. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30114-8_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-30114-8_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30113-1
Online ISBN: 978-3-642-30114-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Map Interface for a Text Data Set by Recursive Clustering

Abstract

Access this chapter

Preview

Similar content being viewed by others

A Text Clustering Algorithm to Detect Basic Level Categories in Texts

Large Scale Text Clustering Method Study Based on MapReduce

A Frequent Term-Based Multiple Clustering Approach for Text Documents

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Map Interface for a Text Data Set by Recursive Clustering

Abstract

Access this chapter

Preview

Similar content being viewed by others

A Text Clustering Algorithm to Detect Basic Level Categories in Texts

Large Scale Text Clustering Method Study Based on MapReduce

A Frequent Term-Based Multiple Clustering Approach for Text Documents

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation