Abstract
In the text document analysis process keywords are often represented in bag-of-words or vector space model. This representation is high-dimensional and sparse. Keyword extraction is considered as core technology of all automatic processing for text materials. Keywords represent in condensed from the essential content of a document. In this paper we used keyword extraction techniques for find an index terms that contain most important information and unique identify the documents. We proposed keyword extraction based text summarization techniques helps to reduce dimensionality of the vector space model at initial level.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Salton, G., Buckley, C.: Term weighting approaches in automatic text retrieval. Cornell University, Tech. Rep. (1987)
Bracewell, D.B., Ren, F.: Multilingual Single Document Keyword Extraction For Information Retrieval. In: Proceedings of NLP-KE, pp. 517–522 (2005)
Frank, E., Paynter, G.W., Witten, I.H., Gutwin, C., Nevill-Manning, C.G.: Domain-specific keyphrase extraction. In: Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, pp. 668–673 (1999)
Azcarraga, A.P., Yap Jr., T.N.: Comparing Keyword Extraction Techniques for WEBSOM Text Archives Singapore 117543 (65) 874-6563 dcsapa@nus.edu.sg
Chandra, M., Gupta, V., Pal, S.K.: A Statistical approach for Automatic Text Summarization by Extraction. In: 2011 International Conference on Communication Systems and Network Technologies (2011)
Wan, X., Yang, J., Xiao, J.: Towards an Iterative Reinforcement Approach for Simultaneous Document Summarization and Keyword Extraction. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, Prague, Czech Republic, pp. 552–559. Association for Computational Linguistics (2007)
Zha, Y.: Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering. In: Proceedings of SIGIR 2002, pp. 113–120 (2002)
Rafiqul Islam, M., Rakibul Islam, M.: An Improved Keyword Extraction Method Using Graph Based Random Walk Model. In: Proceedings of 11th International Conference on Computer and Information Technology (ICCIT 2008), December 25-27 (2008)
Matsuo, Y., Ishizuka, M.: Keyword Extraction from a Single Document using Word Co-occurrence Statistical Information. In: International Journal on Artificial Intelligence Tools World Scientific Publishing Company (2003)
Andrade, M., Valencia: Automatic extraction of keywords from scientific text: application to the knowledge domain of protein families. Bioinformatics 14(7), 600–607 (1998)
Jones, S., Paynter, G.: Automatic extraction of document keyphrases for use in digital libraries: evaluation and applications. Journal of the American Society for Information Science and Technology (2002)
Matsuo, Y., Ishizuka, M.: Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools 13(1), 157–169 (2004)
Sun, Y.-H., He, P.-L., Chen, Z.-G.: An Improved Term Weighting Scheme For Vector Space Model. In: Proceedings of the Third International Conference on Machine Learning & Cybernatics, Shanghai, August 26-29 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ramakrishna Murty, M., Murthy, J.V.R., Prasada Reddy, P.V.G.D., Satapathy, S.C. (2012). Statistical Approach Based Keyword Extraction Aid Dimensionality Reduction. In: Satapathy, S.C., Avadhani, P.S., Abraham, A. (eds) Proceedings of the International Conference on Information Systems Design and Intelligent Applications 2012 (INDIA 2012) held in Visakhapatnam, India, January 2012. Advances in Intelligent and Soft Computing, vol 132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27443-5_51
Download citation
DOI: https://doi.org/10.1007/978-3-642-27443-5_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27442-8
Online ISBN: 978-3-642-27443-5
eBook Packages: EngineeringEngineering (R0)