Statistical Approach Based Keyword Extraction Aid Dimensionality Reduction

Ramakrishna Murty, M.; Murthy, J. V. R.; Prasada Reddy, P. V. G. D.; Satapathy, Suresh Chandra

doi:10.1007/978-3-642-27443-5_51

M. Ramakrishna Murty⁵,
J. V. R. Murthy⁶,
P. V. G. D. Prasada Reddy⁷ &
…
Suresh Chandra Satapathy⁸

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 132))

1220 Accesses
1 Citations

Abstract

In the text document analysis process keywords are often represented in bag-of-words or vector space model. This representation is high-dimensional and sparse. Keyword extraction is considered as core technology of all automatic processing for text materials. Keywords represent in condensed from the essential content of a document. In this paper we used keyword extraction techniques for find an index terms that contain most important information and unique identify the documents. We proposed keyword extraction based text summarization techniques helps to reduce dimensionality of the vector space model at initial level.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Salton, G., Buckley, C.: Term weighting approaches in automatic text retrieval. Cornell University, Tech. Rep. (1987)
Google Scholar
Bracewell, D.B., Ren, F.: Multilingual Single Document Keyword Extraction For Information Retrieval. In: Proceedings of NLP-KE, pp. 517–522 (2005)
Google Scholar
Frank, E., Paynter, G.W., Witten, I.H., Gutwin, C., Nevill-Manning, C.G.: Domain-specific keyphrase extraction. In: Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, pp. 668–673 (1999)
Google Scholar
Azcarraga, A.P., Yap Jr., T.N.: Comparing Keyword Extraction Techniques for WEBSOM Text Archives Singapore 117543 (65) 874-6563 dcsapa@nus.edu.sg
Google Scholar
Chandra, M., Gupta, V., Pal, S.K.: A Statistical approach for Automatic Text Summarization by Extraction. In: 2011 International Conference on Communication Systems and Network Technologies (2011)
Google Scholar
Wan, X., Yang, J., Xiao, J.: Towards an Iterative Reinforcement Approach for Simultaneous Document Summarization and Keyword Extraction. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, Prague, Czech Republic, pp. 552–559. Association for Computational Linguistics (2007)
Google Scholar
Zha, Y.: Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering. In: Proceedings of SIGIR 2002, pp. 113–120 (2002)
Google Scholar
Rafiqul Islam, M., Rakibul Islam, M.: An Improved Keyword Extraction Method Using Graph Based Random Walk Model. In: Proceedings of 11th International Conference on Computer and Information Technology (ICCIT 2008), December 25-27 (2008)
Google Scholar
Matsuo, Y., Ishizuka, M.: Keyword Extraction from a Single Document using Word Co-occurrence Statistical Information. In: International Journal on Artificial Intelligence Tools World Scientific Publishing Company (2003)
Google Scholar
Andrade, M., Valencia: Automatic extraction of keywords from scientific text: application to the knowledge domain of protein families. Bioinformatics 14(7), 600–607 (1998)
Article Google Scholar
Jones, S., Paynter, G.: Automatic extraction of document keyphrases for use in digital libraries: evaluation and applications. Journal of the American Society for Information Science and Technology (2002)
Google Scholar
Matsuo, Y., Ishizuka, M.: Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools 13(1), 157–169 (2004)
Article Google Scholar
Sun, Y.-H., He, P.-L., Chen, Z.-G.: An Improved Term Weighting Scheme For Vector Space Model. In: Proceedings of the Third International Conference on Machine Learning & Cybernatics, Shanghai, August 26-29 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept of CSE, GMR Institute of Technology, Rajam, Srikakulam(Dist), A.P., India
M. Ramakrishna Murty
Dept of CSE, JNTU, Kakinada, A.P., India
J. V. R. Murthy
Dept of CS&SE, A.U, Visakhapatnam, A.P., India
P. V. G. D. Prasada Reddy
Dept of CSE, ANITS, Visakhapatna, A.P., India
Suresh Chandra Satapathy

Authors

M. Ramakrishna Murty
View author publications
You can also search for this author in PubMed Google Scholar
J. V. R. Murthy
View author publications
You can also search for this author in PubMed Google Scholar
P. V. G. D. Prasada Reddy
View author publications
You can also search for this author in PubMed Google Scholar
Suresh Chandra Satapathy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept of Computer Science and Engineering ANITS, Andhra University, Sangivalasa, 530003, Vishakapatnam, India
Suresh Chandra Satapathy
College of Engineering Dept. of CS&SE ANITS, Andhra University, Sangivalasa, 530003, Vishakapatnam, India
P. S. Avadhani
Machine Intelligence Research Labs, Auburn, WA, USA
Ajith Abraham

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ramakrishna Murty, M., Murthy, J.V.R., Prasada Reddy, P.V.G.D., Satapathy, S.C. (2012). Statistical Approach Based Keyword Extraction Aid Dimensionality Reduction. In: Satapathy, S.C., Avadhani, P.S., Abraham, A. (eds) Proceedings of the International Conference on Information Systems Design and Intelligent Applications 2012 (INDIA 2012) held in Visakhapatnam, India, January 2012. Advances in Intelligent and Soft Computing, vol 132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27443-5_51

Download citation

DOI: https://doi.org/10.1007/978-3-642-27443-5_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27442-8
Online ISBN: 978-3-642-27443-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics