Automatic Hyponymy Identification from Brazilian Portuguese Texts

Taba, Leonardo Sameshima; de Medeiros Caseli, Helena

doi:10.1007/978-3-642-28885-2_21

Leonardo Sameshima Taba²³ &
Helena de Medeiros Caseli²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7243))

Included in the following conference series:

International Conference on Computational Processing of the Portuguese Language

1130 Accesses
3 Citations

Abstract

In the natural language processing (NLP) scenario, Brazilian Portuguese (and Portuguese in general) still suffers from the lack of good quality base tools (e.g. parsers) and resources (e.g. annotated corpora). Corpora annotated with semantic information is particularly scarce and is a very costly resource to be produced manually. In order to provide some help to mend that situation, this paper presents an automatic hyponymy identification method for Brazilian Portuguese texts. The proposed method uses lexical and syntactic data alongside common sense information obtained from the Brazilian Open Mind Common Sense project (OMCS-Br). The results obtained so far are compatible with previous work and encourage other directions for further research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Berland, M., Charniak, E.: Finding parts in very large corpora. In: Proceedings of the 37th Annual Meeting of the ACL, pp. 57–64. ACL, College park (1999)
Google Scholar
Bick, E.: The Parsing System” Palavras”: Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework. Aarhus University Press (2000)
Google Scholar
Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka Jr., E.R., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: Proceedings of the AAAI (2010)
Google Scholar
Fellbaum, C.: WordNet: An electronic lexical database. The MIT press, Cambridge (1998)
MATH Google Scholar
de Freitas, M.C., Quental, V.: Subsídios para a elaboração automática de taxonomias. In: Anais do XXVII Congresso da SBC, pp. 1585–1594. V TIL, Rio de Janeiro (2007)
Google Scholar
Girju, R., Beamer, B., Rozovskaya, A., Fister, A., Bhat, S.: A knowledge-rich approach to identifying semantic relations between nominals. Information Processing and Management 46(5), 589–610 (2010)
Article Google Scholar
Girju, R., Moldovan, D.: Text mining for causal relations. In: Proceedings of the FLAIRS 2002, pp. 360–364. AAAI Press, Pensacola (2002)
Google Scholar
Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics, vol. 2, pp. 539–545. ACL, Nantes (1992)
Chapter Google Scholar
Hearst, M.A.: Automated discovery of wordnet relations. In: WordNet: An Electronic Lexical Database, ch.5, pp. 131–151. The MIT press (1998)
Google Scholar
Minsky, M.: The Society of Mind. Simon and Schuster (1986)
Google Scholar
dos Santos, C.N., Oliveira, C.: Aplicação de aprendizado baseado em transformaçoes na identificação de sintagmas nominais. In: Anais do XXV Congresso da SBC, pp. 2138–2147. III TIL, Brasil (2005)
Google Scholar
Singh, P., Lin, T., Mueller, E.T., Lim, G., Perkins, T., Zhu, W.L.: Open Mind Common Sense: Knowledge Acquisition from the General Public. In: Meersman, R., Tari, Z. (eds.) CoopIS/DOA/ODBASE 2002. LNCS, vol. 2519, pp. 1223–1237. Springer, Heidelberg (2002)
Chapter Google Scholar
Sugiyama, B.A., Anacleto, J.C., de Medeiros Caseli, H.: Assisting users in a cross-cultural communication by providing culturally contextualized translations. In: Proceedings of the 29th ACM International Conference on Design of Communication, SIGDOC 2011, October 03-05, pp. 189–194. ACM, Pisa (2011)
Google Scholar
Zhang, M., Zhang, J., Su, J., Zhou, G.: A composite kernel to extract relations between entities with both flat and structured features. In: Proceedings of the 21st COLING and the 44th Annual Meeting of the ACL, ACL-44, pp. 825–832. ACL, Stroudsburg (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, LaLiC/NILC, Federal University of São Carlos (UFSCar), São Carlos, SP, Brazil
Leonardo Sameshima Taba & Helena de Medeiros Caseli

Authors

Leonardo Sameshima Taba
View author publications
You can also search for this author in PubMed Google Scholar
Helena de Medeiros Caseli
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

UFSCAR, Rod. Washington Luís, 13565-905, São Carlos, Brazil
Helena Caseli
UFRGS, Av. Bento Gonçalves, 9500, 91501-970, Porto Alegre, Brazil
Aline Villavicencio
DETI/IEETA, Universidade de Aveiro, Campus Universitário de Santiago, 3810-193, Aveiro, Portugal
António Teixeira
UC/ IT, DEEC, Universidade de Coimbra, Polo 2, 3030-290, Coimbra, Portugal
Fernando Perdigão

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Taba, L.S., de Medeiros Caseli, H. (2012). Automatic Hyponymy Identification from Brazilian Portuguese Texts. In: Caseli, H., Villavicencio, A., Teixeira, A., Perdigão, F. (eds) Computational Processing of the Portuguese Language. PROPOR 2012. Lecture Notes in Computer Science(), vol 7243. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28885-2_21

Download citation

DOI: https://doi.org/10.1007/978-3-642-28885-2_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28884-5
Online ISBN: 978-3-642-28885-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics