Skip to main content

Incorporating Contextual Information in Prediction Based Word Embedding Models

  • Conference paper
  • First Online:
Rising Threats in Expert Applications and Solutions

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 434))

  • 466 Accesses

Abstract

A clustering-based language model is proposed for analyzing the performance of context sensitive word embedding models that uses contextual information. The construction of text readability and prediction models faces several shortcomings due to the complex nature and structure of language. The language structure is complicated all the more by words having vastly different meanings and interpretation based on the context in which it is used. This paper aims to resolve this issue by first clustering the sentences based on similarity and then performing word embedding separately on each of these clusters to obtain an enhanced outcome. This would serve to embed the same word separately in varying context as an improvement over the standard existing word embedding models provided by various prediction based models. Comparing the two approaches, our results have showed that clustering improves the performance of the model and discriminate the contextual information based on sense which leads to more accurate representation in vector form.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Y. Sun, N. Rao, W. Ding, A Simple approach to learn polysemous word embeddings (2017)

    Google Scholar 

  2. M. Cha, Y. Gwon, H.T. Kung, Language modeling by clustering with word embeddings for text readability assessment (2017)

    Google Scholar 

  3. Z. Yang, W. Chen, F. Wang, B. Xu, Multi-sense based neural machine translation, in 2017 International Joint Conference on Neural Networks (IJCNN)

    Google Scholar 

  4. Mikolov et al., Efficient Estimation of Word Representations in Vector Space (2013)

    Google Scholar 

  5. Mikolov et al., Distributed Representations of Words and Phrases and their Compositionality (2013)

    Google Scholar 

  6. T. Zhang, R. Ramakrishnan, M. Livny, BIRCH: an efficient data clustering method for very large databases (1996)

    Google Scholar 

  7. J.E. Alvarez, A review of word embedding and document similarity algorithms applied to academic text (2017)

    Google Scholar 

  8. P. Bojanowski, E. Grave, A. Joulin, T. Mikolov, Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5, 135–146 (2017)

    Article  Google Scholar 

  9. W. Zhu, X. Jin, J. Ni, B. Wei, Z. Lu, Improve word embedding using both writing and pronunciation. PLoS ONE 13(12), e0208785 (2018). https://doi.org/10.1371/journal.pone.0208785

    Article  Google Scholar 

  10. Z.-L. Ye, H.-X. Zhao, Syntactic word embedding based on dependency syntax and polysemous analysis. Front. Inf. Technol. Electron. Eng. 19, 524–535 (2018). https://doi.org/10.1631/FITEE.1601846

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. Vimal Kumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Vimal Kumar, K., Ahuja, S., Choudary, S., Parwekar, P. (2022). Incorporating Contextual Information in Prediction Based Word Embedding Models. In: Rathore, V.S., Sharma, S.C., Tavares, J.M.R., Moreira, C., Surendiran, B. (eds) Rising Threats in Expert Applications and Solutions. Lecture Notes in Networks and Systems, vol 434. Springer, Singapore. https://doi.org/10.1007/978-981-19-1122-4_37

Download citation

Publish with us

Policies and ethics