Skip to main content

Automatic Hyponymy Identification from Brazilian Portuguese Texts

  • Conference paper
Book cover Computational Processing of the Portuguese Language (PROPOR 2012)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7243))

Abstract

In the natural language processing (NLP) scenario, Brazilian Portuguese (and Portuguese in general) still suffers from the lack of good quality base tools (e.g. parsers) and resources (e.g. annotated corpora). Corpora annotated with semantic information is particularly scarce and is a very costly resource to be produced manually. In order to provide some help to mend that situation, this paper presents an automatic hyponymy identification method for Brazilian Portuguese texts. The proposed method uses lexical and syntactic data alongside common sense information obtained from the Brazilian Open Mind Common Sense project (OMCS-Br). The results obtained so far are compatible with previous work and encourage other directions for further research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berland, M., Charniak, E.: Finding parts in very large corpora. In: Proceedings of the 37th Annual Meeting of the ACL, pp. 57–64. ACL, College park (1999)

    Google Scholar 

  2. Bick, E.: The Parsing System” Palavras”: Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework. Aarhus University Press (2000)

    Google Scholar 

  3. Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka Jr., E.R., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: Proceedings of the AAAI (2010)

    Google Scholar 

  4. Fellbaum, C.: WordNet: An electronic lexical database. The MIT press, Cambridge (1998)

    MATH  Google Scholar 

  5. de Freitas, M.C., Quental, V.: Subsídios para a elaboração automática de taxonomias. In: Anais do XXVII Congresso da SBC, pp. 1585–1594. V TIL, Rio de Janeiro (2007)

    Google Scholar 

  6. Girju, R., Beamer, B., Rozovskaya, A., Fister, A., Bhat, S.: A knowledge-rich approach to identifying semantic relations between nominals. Information Processing and Management 46(5), 589–610 (2010)

    Article  Google Scholar 

  7. Girju, R., Moldovan, D.: Text mining for causal relations. In: Proceedings of the FLAIRS 2002, pp. 360–364. AAAI Press, Pensacola (2002)

    Google Scholar 

  8. Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics, vol. 2, pp. 539–545. ACL, Nantes (1992)

    Chapter  Google Scholar 

  9. Hearst, M.A.: Automated discovery of wordnet relations. In: WordNet: An Electronic Lexical Database, ch.5, pp. 131–151. The MIT press (1998)

    Google Scholar 

  10. Minsky, M.: The Society of Mind. Simon and Schuster (1986)

    Google Scholar 

  11. dos Santos, C.N., Oliveira, C.: Aplicação de aprendizado baseado em transformaçoes na identificação de sintagmas nominais. In: Anais do XXV Congresso da SBC, pp. 2138–2147. III TIL, Brasil (2005)

    Google Scholar 

  12. Singh, P., Lin, T., Mueller, E.T., Lim, G., Perkins, T., Zhu, W.L.: Open Mind Common Sense: Knowledge Acquisition from the General Public. In: Meersman, R., Tari, Z. (eds.) CoopIS/DOA/ODBASE 2002. LNCS, vol. 2519, pp. 1223–1237. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  13. Sugiyama, B.A., Anacleto, J.C., de Medeiros Caseli, H.: Assisting users in a cross-cultural communication by providing culturally contextualized translations. In: Proceedings of the 29th ACM International Conference on Design of Communication, SIGDOC 2011, October 03-05, pp. 189–194. ACM, Pisa (2011)

    Google Scholar 

  14. Zhang, M., Zhang, J., Su, J., Zhou, G.: A composite kernel to extract relations between entities with both flat and structured features. In: Proceedings of the 21st COLING and the 44th Annual Meeting of the ACL, ACL-44, pp. 825–832. ACL, Stroudsburg (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Taba, L.S., de Medeiros Caseli, H. (2012). Automatic Hyponymy Identification from Brazilian Portuguese Texts. In: Caseli, H., Villavicencio, A., Teixeira, A., Perdigão, F. (eds) Computational Processing of the Portuguese Language. PROPOR 2012. Lecture Notes in Computer Science(), vol 7243. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28885-2_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-28885-2_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-28884-5

  • Online ISBN: 978-3-642-28885-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics