Abstract
In the natural language processing (NLP) scenario, Brazilian Portuguese (and Portuguese in general) still suffers from the lack of good quality base tools (e.g. parsers) and resources (e.g. annotated corpora). Corpora annotated with semantic information is particularly scarce and is a very costly resource to be produced manually. In order to provide some help to mend that situation, this paper presents an automatic hyponymy identification method for Brazilian Portuguese texts. The proposed method uses lexical and syntactic data alongside common sense information obtained from the Brazilian Open Mind Common Sense project (OMCS-Br). The results obtained so far are compatible with previous work and encourage other directions for further research.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Berland, M., Charniak, E.: Finding parts in very large corpora. In: Proceedings of the 37th Annual Meeting of the ACL, pp. 57–64. ACL, College park (1999)
Bick, E.: The Parsing System” Palavras”: Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework. Aarhus University Press (2000)
Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka Jr., E.R., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: Proceedings of the AAAI (2010)
Fellbaum, C.: WordNet: An electronic lexical database. The MIT press, Cambridge (1998)
de Freitas, M.C., Quental, V.: Subsídios para a elaboração automática de taxonomias. In: Anais do XXVII Congresso da SBC, pp. 1585–1594. V TIL, Rio de Janeiro (2007)
Girju, R., Beamer, B., Rozovskaya, A., Fister, A., Bhat, S.: A knowledge-rich approach to identifying semantic relations between nominals. Information Processing and Management 46(5), 589–610 (2010)
Girju, R., Moldovan, D.: Text mining for causal relations. In: Proceedings of the FLAIRS 2002, pp. 360–364. AAAI Press, Pensacola (2002)
Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics, vol. 2, pp. 539–545. ACL, Nantes (1992)
Hearst, M.A.: Automated discovery of wordnet relations. In: WordNet: An Electronic Lexical Database, ch.5, pp. 131–151. The MIT press (1998)
Minsky, M.: The Society of Mind. Simon and Schuster (1986)
dos Santos, C.N., Oliveira, C.: Aplicação de aprendizado baseado em transformaçoes na identificação de sintagmas nominais. In: Anais do XXV Congresso da SBC, pp. 2138–2147. III TIL, Brasil (2005)
Singh, P., Lin, T., Mueller, E.T., Lim, G., Perkins, T., Zhu, W.L.: Open Mind Common Sense: Knowledge Acquisition from the General Public. In: Meersman, R., Tari, Z. (eds.) CoopIS/DOA/ODBASE 2002. LNCS, vol. 2519, pp. 1223–1237. Springer, Heidelberg (2002)
Sugiyama, B.A., Anacleto, J.C., de Medeiros Caseli, H.: Assisting users in a cross-cultural communication by providing culturally contextualized translations. In: Proceedings of the 29th ACM International Conference on Design of Communication, SIGDOC 2011, October 03-05, pp. 189–194. ACM, Pisa (2011)
Zhang, M., Zhang, J., Su, J., Zhou, G.: A composite kernel to extract relations between entities with both flat and structured features. In: Proceedings of the 21st COLING and the 44th Annual Meeting of the ACL, ACL-44, pp. 825–832. ACL, Stroudsburg (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Taba, L.S., de Medeiros Caseli, H. (2012). Automatic Hyponymy Identification from Brazilian Portuguese Texts. In: Caseli, H., Villavicencio, A., Teixeira, A., Perdigão, F. (eds) Computational Processing of the Portuguese Language. PROPOR 2012. Lecture Notes in Computer Science(), vol 7243. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28885-2_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-28885-2_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28884-5
Online ISBN: 978-3-642-28885-2
eBook Packages: Computer ScienceComputer Science (R0)