Skip to main content

Map Interface for a Text Data Set by Recursive Clustering

  • Chapter
Advances in Chance Discovery

Part of the book series: Studies in Computational Intelligence ((SCI,volume 423))

  • 489 Accesses

Abstract

Recently, there have been many opportunities to acquire text information as the quantity of electronic information increases. Data classification or clustering methods are widely adapted in order to acquire various information effectively from an enormous active text data set. However, ordinal clustering methods connect texts and many texts are concentrated into a single cluster so that we cannot see various information.

In this study, we propose a recursive clustering method to avoid such bias by integrating a set of texts, included in a cluster, into a single text. An interface that we can comprehend a result of clustering intuitively and can explore information is required to grasp an overview of data and to be led to a new idea. According to the experimental results, the proposed method could construct clusters that are not biased. Test subjects could find information widely by using a map visualizing clustering results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Ohsawa, Y., Nara, Y.: Decision Process Modeling across Internet and Real World by Double Helical Model of Chance-Discovery. New Generation Computing 21(2), 109–122 (2003)

    Article  MATH  Google Scholar 

  2. McCallum, A., Nigam, K.: A Comparison of Event Models for Naive Bayes Text Classification. In: Proc. of AAAI-1998 Workshop on Learning for Text Categorization, pp. 41–48 (1998)

    Google Scholar 

  3. Nigam, K., McCallum, A.K., Thrun, S., Mitchell, T.: Text Classification from Labeled and Unlabeled Documents using EM. Machine Learning 39, 103–134 (2000)

    Article  MATH  Google Scholar 

  4. Tong, S., Koller, D.: Support Vector Machine Active Learning with Applications to Text Classification. Journal of Machine Learning Research 1, 45–66 (2001)

    Google Scholar 

  5. Lodhi, H., Saunders, C., Shawe-Taylor, J., Cristianini, N., Watkins, C.: Text Classification using String Kernels. Journal of Machine Learning Research 2, 419–444 (2002)

    MATH  Google Scholar 

  6. Begelman, G., Keller, P., Smadja, F.: Automated Tag Clustering: Improving search and exploration in the tag space. In: Proc. of Collaborative Web Tagging Workshop at WWW (2006)

    Google Scholar 

  7. Kobayashi, T., Misue, K., Shizuki, B., Tanaka, J.: Information Gathering Support Interface by the Overview Presentation of Web Search Results. In: Proc. of Asia-Pacific Symposium on Information Visualisation, vol. 60, pp. 103–108 (2006)

    Google Scholar 

  8. Sprenger, T.C., Brunella, R., Gross, M.H.: H-BLOB: A Hierarchical Visual Clustering Method Using Implicit Surfaces. In: Proc. of the 11th IEEE Conference on Visualization (2000)

    Google Scholar 

  9. Akhavi, M.S., Rahmati, M., Amini, N.N.: 3D Visualization of Hierarchical Clustered Web Search Results. In: Proc. of International Conference on Computer Graphics, Imaging and Visualisation, pp. 441–446 (2007)

    Google Scholar 

  10. Muhr, M., Sabol, V., Granitzer, M.: Scalable Recursive Top-Down Hierarchical Clustering Approach with implicit Model Selection for Textual Data Sets. In: IEEE Workshops on Database and Expert Systems Applications, pp. 15–19 (2010)

    Google Scholar 

  11. Nishikido, T., Sunayama, W., Nishihara, Y.: Valuable Change Detection in Keyword Map Animation. In: Gao, Y., Japkowicz, N. (eds.) AI 2009. LNCS, vol. 5549, pp. 233–236. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  12. Newman, M.E.: Fast algorithm for detecting community structure in networks. Phys. Rev. E 69 066133, 1–5 (2004)

    Google Scholar 

  13. Nohuddin, P.N.E., Christley, R., Coenen, F., Setzkorn, C., Sunayama, W.: Trend Mining and Visualisation in Social Networks. In: Proc. of the Thirty-first SGAI International Conference on Innovative Techniques and Application of Artificial Intelligence(AI 2011), pp. 269–282. Springer (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wataru Sunayama .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Sunayama, W., Hamaoka, S., Okuda, K. (2013). Map Interface for a Text Data Set by Recursive Clustering. In: Ohsawa, Y., Abe, A. (eds) Advances in Chance Discovery. Studies in Computational Intelligence, vol 423. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30114-8_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-30114-8_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-30113-1

  • Online ISBN: 978-3-642-30114-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics