Abstract
The goal of this paper is to provide a visual structure to view the temporal and thematic content of English news documents. This paper combines both information extraction techniques and visualization techniques to visualize the documents content. Topical information present in the documents are extracted by applying Booksteins model and by calculating the various measures associated with the model. Temporal information is extracted as date information from the documents. Later the extracted information is converted into a signal and both discrete and continuous wavelet transfor-mations are applied over the signal. The content of the document is visualized at varying levels of detail by multi resolution analysis techniques available in wavelets. This gives a wavelet based visualization model which enables us to view the spread of topics across a document, the portions of the document that are most involved in topic description, and the contribution of documents in the corpus from the temporal perspective.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Miller, N.E., Wong, P.C., Brewste, M., Foote, H.: TOPIC ISLANDSTM - A Wavelet-Based Text Visualization System, Pacific Northwest National Laboratoryt’ tt’, IEEE (1998)
Zwa, A., Ebert, D.S., Miller, E.L.: Multiresolution Document Analysis with Wavelets November 22. In: Conference on Information and knowledge management, Rockville, MD, December 1996, pp. 50–53 (1996)
Bookstein, A., Klein, S.T., Rait, T.: Clumping properties of content bearing words-white paper
Blei, D.M., Ng, A.Y., Michael, I.: Latent dirichlet allocation. Journal of machine learning research 3, 993–1022 (2003)
Luhn, H.P.: A statistical approach to mechanized encoding and searching of literary information. IBM Journal of Research and Development 1(4), 309–317 (1957)
Chinchor, N.: MUC7 Named entity task definition (September 1997), http://www.itl.nist.gov/894.02/related_projects/muc/proceedings/ne_task.html
MUC-6 named entity task definition (May 1995), http://www.cs.nyu.edu/cs/faculty/grishman/NEtask20.book_1.html
Laurence, A.F.: Park Spectral based information retrieval –White paper
Carlberger, J., Dalianis, H., Hassel, M., Knutsson, O.: Improving Precision in Information Retrieval for Swedish using Stemming
Mala, T., Geetha, T.V.: isualization of Biological Patterns in Event Detection and Tracking Based on SRP algorithm. In: Proceedings of Ninth International conference on Information Visualization, London, England, July 6-8 2005, pp. 587–592. IEEE CS press, Los Alamitos (2005)
Katz, G., Pustejovsky, J., Schilder, F.: Annotation extracting and reasoning about time and events. In: Dagstahl seminar proceedings 05151
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mala, T., Geetha, T.V., Kumar, S. (2006). Topical and Temporal Visualization Using Wavelets. In: Yang, Q., Webb, G. (eds) PRICAI 2006: Trends in Artificial Intelligence. PRICAI 2006. Lecture Notes in Computer Science(), vol 4099. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-36668-3_91
Download citation
DOI: https://doi.org/10.1007/978-3-540-36668-3_91
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36667-6
Online ISBN: 978-3-540-36668-3
eBook Packages: Computer ScienceComputer Science (R0)