Skip to main content

Coreference Resolution in the Assamese Language: A Pioneering Attempt

  • Conference paper
  • First Online:
Data Science and Communication (ICTDsC 2023)

Abstract

Coreference resolution is the process of identifying all expressions in a text that refer to the same entities. It is an essential task in several Natural Language Processing (NLP) tasks to interpret the text properly. This is used in many sophisticated NLP tasks like machine translation, information extraction, document summarization, question answering, etc. It is a complex problem with numerous challenges and ramifications for linguistics and computation. There is limited work in Indic languages, and it is hardly found in Assamese languages. Our effort is one of the pioneering attempts in the Assamese coreference resolution that is reported in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 279.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.newjobsinassam.com/gk/download-best-assamese-books-pdf/.

  2. 2.

    https://assamintro.com/mamoni-raisom-goswami-biography-in-assamese/.

  3. 3.

    https://dainikjanambhumi.co.in/.

  4. 4.

    https://www.jonakaxom.in/2019/08/assamese-short-story.html.

  5. 5.

    https://as.wikipedia.org/sports.

  6. 6.

    https://dte.assam.gov.in/portlets/study-materials.

  7. 7.

    https://as.wikipedia.org/wiki/tourism.

References

  1. Clark K, Manning CD (2016) Deep reinforcement learning for mention-ranking coreference models. In: Proceedings of the 2016 conference on empirical methods in natural language processing. Association for Computational Linguistics, Austin, Texas, pp 2256–2262

    Google Scholar 

  2. Lee K, He L, Lewis M, Moyer LZ (2017) End-to-end neural coreference resolution. In: Proceedings of the 2017 conference on empirical methods in natural language processing. Association for Computational Linguistics, Copenhagen, Denmark, pp 188–197

    Google Scholar 

  3. Mitkov R (1999) Anaphora resolution: the state of the art. Citeseer

    Google Scholar 

  4. Ng V (2010) Supervised noun phrase coreference research: the first fifteen years. In: Proceedings of the 48th annual meeting of the association for computational linguistics. Association for Computational Linguistics, Uppsala, Sweden, pp 1396–1411

    Google Scholar 

  5. Hobbs JR (1978) Resolving pronoun references. Lingua 44:339–352

    Google Scholar 

  6. Lappin S, Leass HJ (1994) An algorithm for pronominal anaphora resolution. Comput Linguist 20(4):535–561

    Google Scholar 

  7. Ge N, Hale J, Charniak E (1998) A statistical approach to anaphora resolution. In: Sixth workshop on very large corpora

    Google Scholar 

  8. Soon WM, Ng HT, Lim DCY (2001) A machine learning approach to coreference resolution of noun phrases. Comput Linguist 27(4):521–544

    Article  Google Scholar 

  9. Wiseman S, Rush AM, Shieber SM (2016) Learning global features for coreference resolution. In: Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies. Association for Computational Linguistics, San Diego, California, pp 994–1004

    Google Scholar 

  10. Young T, Hazarika D, Poria S, Cambria E (2018) Recent trends in deep learning based natural language processing [review article]. IEEE Comput Intell Mag 13(3):55–75

    Article  Google Scholar 

  11. Carter DM (1987) Interpreting anaphors in natural language texts

    Google Scholar 

  12. Rich E, LuperFoy S (1988) An architecture for anaphora resolution. In: Second conference on applied natural language processing. Association for Computational Linguistics, Austin, Texas, USA, pp 18–24

    Google Scholar 

  13. Carbonell JG, Brown RD (1988) Anaphora resolution: a multi-strategy approach. In: Cooling Budapest 1988 volume 1: international conference on computational linguistic

    Google Scholar 

  14. Mitkov R (1994) An integrated model for anaphora resolution. In COLING 1994 volume 2: the 15th international conference on computational linguistics

    Google Scholar 

  15. Mitkov R (1996) Anaphora resolution: a combination of linguistic and statistical approaches. In: Proceedings of the discourse anaphora and anaphor resolution (DAARC’96). Lancaster, UK

    Google Scholar 

  16. Kennedy C, Boguraev B (1996) Anaphora for everyone: Pronominal anaphora resolution without a parser. In: COLING 1996 volume 1: the 16th international conference on computational linguistics

    Google Scholar 

  17. Breck B (1997) CogNIAC: high precision coreference with limited knowledge and linguistic resources. In: Operational factors in practical, robust anaphora resolution for unrestricted texts

    Google Scholar 

  18. Raghunathan K, Lee H, Rangarajan S, Chambers N, Surdeanu M, Jurafsky D, Manning C (2010) A multipass sieve for coreference resolution. In: Proceedings of the 2010 conference on empirical methods in natural language processing. Association for Computational Linguistics, Cambridge, MA, pp 492–501

    Google Scholar 

  19. Lee H, Recasens M, Chang A, Surdeanu M, Jurafsky D (2012) Joint entity and event coreference resolution across documents. In: Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning. Association for Computational Linguistics, Jeju Island, Korea, pp 489–500

    Google Scholar 

  20. Lu J, Venugopal D, Gogate V, Ng V (2016) Joint inference for event coreference resolution. In: Proceedings of COLING 2016, the 26th international conference on computational linguistics: technical papers. The COLING 2016 Organizing Committee, Osaka, Japan, pp 3264–3275

    Google Scholar 

  21. Lai TM, Bui T, Kim DS (2022) End-to-end neural coreference resolution revisited: a simple yet effective baseline. In: ICASSP 2022—2022 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 8147–8151

    Google Scholar 

  22. Gandhi N, Field A, Tsvetkov Y (2021) Improving span representation for domain adapted coreference resolution. In: Proceedings of the fourth workshop on computational models of reference, anaphora and coreference. Association for Computational Linguistics, Punta Cana, Dominican Republic, pp 121–131

    Google Scholar 

  23. Xu L, Choi JD (2020) Revealing the myth of higher order inference in coreference resolution. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 8527–8533

    Google Scholar 

  24. Pradhan S, Moschitti A, Xue N, Uryupina O, Zhang Y (2012) CoNLL 2012 shared task: modeling multilingual unrestricted coreference in OntoNotes. In: Joint conference on EMNLP and CoNLL—shared task. Association for Computational Linguistics, Jeju Island, Korea, pp 1–40

    Google Scholar 

  25. Zeldes A, Zhang S (2016) When annotation schemes change rules help: a configurable approach to coreference resolution beyond OntoNotes. In: Proceedings of the workshop on coreference resolution beyond OntoNotes (CORBON 2016). Association for Computational Linguistics, San Diego, California, pp 92–101

    Google Scholar 

  26. Haghighi A, Klein D (2010) Coreference resolution in a modular, entity-centered model. In: Human language technologies: the 2010 annual conference of the North American. Association for Computational Linguistics, Los Angeles, California, pp 385–393

    Google Scholar 

  27. Ram VS, Lalitha Devi S (2017) Coreference resolution in Tamil text. In: Proceedings of the 14th international conference on natural language processing (ICON 2017). NLP Association of India, Kolkata, India, pp 392–401

    Google Scholar 

  28. Sobha LD, Rao PRK, Ram RVS, Malarkodi CS, Akilandeswari A (2011) Hybrid approach for coreference resolution. In: Proceedings of the fifteenth conference on computational natural language learning: shared task. Association for Computational Linguistics, Portland, Oregon, USA, pp 93–96

    Google Scholar 

  29. Sobha L, Patnaik BN (2000) VASISTH: an ellipsis resolution algorithm for Indian languages. In: Proceedings of the international conference on machine translation and multilingual applications in the new millennium: MT 2000. University of Exeter, UK

    Google Scholar 

  30. Lalitha Devi S, Sundar Ram V, Rao PRK (2014) A generic anaphora resolution engine for Indian languages. In: Proceedings of COLING 2014, the 25th international conference on computational linguistics: technical papers. Dublin City University and Association for Computational Linguistics, Dublin, Ireland, pp 1824–1833

    Google Scholar 

  31. Senapati A, Garain U (2013) GuiTAR-based pronominal anaphora resolution in Bengali. In: Proceedings of the 51st annual meeting of the association for computational linguistics (Volume 2: Short Papers). Association for Computational Linguistics, Sofia, Bulgaria, pp 126–130

    Google Scholar 

  32. Senapati A, Garain U (2011) Anaphora resolution system for Bengali by pronoun emitting approach. In: The proceedings of international conference on natural language processing (NLP tool contest), pp 21–26

    Google Scholar 

  33. Sikdar S, Bhattacharya P, Heese K (2021) Integrated directional gradients: feature interaction attribution for neural NLP models. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (Volume 1: long papers), Online. Association for Computational Linguistics, pp 865–878

    Google Scholar 

  34. Mujadia V, Gupta P, Sharma DM (2016) Coreference annotation scheme and relation types for Hindi. In: Proceedings of the tenth international conference on language resources and evaluation (LREC’16). European Language Resources Association (ELRA), Portorož, Slovenia, pp 161–168

    Google Scholar 

  35. Ram RVS, Lalitha Devi S (2012) Coreference resolution using tree CRFS. In: Computational linguistics and intelligent text processing. Springer Berlin Heidelberg, Berlin, Heidelberg, pp 285–296

    Google Scholar 

  36. Pradhan S, Ramshaw L, Marcus M, Palmer M, Weischedel R, Xue N (2011) CoNLL-2011 shared task: modeling unrestricted coreference in OntoNotes. In: Proceedings of the fifteenth conference on computational natural language learning: shared task. Association for Computational Linguistics, Portland, Oregon, USA, pp 1–27

    Google Scholar 

  37. Hingmire S, Ramrakhiyani N, Singh AK, Patil S, Palshikar G, Bhattacharyya P, Varma V (2020) Extracting message sequence charts from Hindi narrative text. In: Proceedings of the first joint workshop on narrative understanding, storylines, and events. Association for Computational Linguistics, pp 87–96

    Google Scholar 

  38. Pakray P, Neogi S, Bhaskar P, Poria S, Bandyopadhyay S, Gelbukh AF (2011) A textual entailment system using anaphora resolution. In: TAC

    Google Scholar 

  39. Chatterji S, Dhar A, Barik B, Moumita PK, Sarkar S, Basu A (2011) Anaphora resolution for Bengali, Hindi, and Tamil using random tree algorithm in Weka. In: Proceedings of the ICON-2011

    Google Scholar 

  40. Sikdar UK, Ekbal A, Saha S (2016) A generalized framework for anaphora resolution in Indian languages. Knowl-Based Syst 109(147–159):645

    Google Scholar 

  41. Sikdar UK, Ekbal A, Saha S, Uryupina O, Poesio M (2013a) Adapting a state-of-the-art anaphora resolution system for resource-poor language. In: Proceedings of the sixth international joint conference on natural language processing, pp 815–821, 651

    Google Scholar 

  42. Sikdar UK, Ekbal A, Saha S, Uryupina O, Poesio M (2013) Anaphora resolution for Bengali: an experiment with domain adaptation. Computación y Sistemas 17(2):137–146

    Google Scholar 

  43. Kummerfeld JK, Klein D (2013) Error driven analysis of challenges in coreference resolution. In: Proceedings of the 2013 conference on empirical methods in natural language processing, pp 265–277

    Google Scholar 

  44. Toutanova K, Klein D, Manning C, Singer Y (2003) Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proceedings of HLT-NAACL, pp 252–259

    Google Scholar 

  45. Bharati A, Sangal R, Sharma DM, Bai L (2006) Annotating corpora guidelines for POS and chunk annotation for Indian languages. In Language Technologies Research Centre, IIIT, Hyderabad

    Google Scholar 

  46. Senapati A, Garain U (2012) Anaphora resolution in Bengali using global discourse knowledge. In: International conference of Asian language processing (IALP). Hanoi, Vietnam

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mridusmita Das .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Das, M., Senapati, A. (2024). Coreference Resolution in the Assamese Language: A Pioneering Attempt. In: Tavares, J.M.R.S., Rodrigues, J.J.P.C., Misra, D., Bhattacherjee, D. (eds) Data Science and Communication. ICTDsC 2023. Studies in Autonomic, Data-driven and Industrial Computing. Springer, Singapore. https://doi.org/10.1007/978-981-99-5435-3_41

Download citation

Publish with us

Policies and ethics