Abstract
With the increase in the number and size of computer learner corpora in the field of Second Language Acquisition, there is a growing need to automatically analyze the language produced by learners. However, the computational tools developed for natural language processing are generally not considered as appropiate because they are designed to treat native language. This paper analyzes the reliability of two part-of-speech taggers on second language Spanish and investigates the most frequent tagger errors and the impact of learner errors in the performance of the taggers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Granger, S.: Computer learner corpus research: current status and future prospects. In: Connor, U., Upton, T. (eds.) Applied Corpus Linguistics: A Multidimensional Perspective, pp. 123ā145. Amsterdam & Atlanta, Rodopi (2004)
Nicholls, D.: The cambridge learner corpus ā error coding and analysis for lexicography and elt. In: Wilson, A., et al. (eds.) Proceedings of the Corpus Linguistics 2003 Conference (CL 2003), Technical papers 16, Lancaster University, Archer et al. (2003)
Rooy, B.V., SchƤfer, L.: Automatic POS tagging of a learner corpus: The influence of learner error on tagger accuracy. In: Archer, D., Rayson, P., Wilson, A., McEnery, T. (eds.) Proceedings of the Corpus Linguistics 2003 Conference, CL 2003 (2003)
Thouesny, S.: Increasing the reliability of a part-of-speech tagging tool for use with learner language. In: Automatic Analysis of Learner Language, AALL 2009 (2009)
Mancera, A.M.C., MartĆnez, I.P., Canales, A.B., FernĆ”ndez, L.C., Granda, J.F.S.: Corpus para el anĆ”lisis de errores de aprendices de E/LE (CORANE). In: Sanz, A.G. (ed.) Actas del XII Congreso Internacional de ASELE: tecnologĆas de la informaciĆ³n y de las comunicaciones en la enseƱanza de la E/LE, pp. 527ā534 (2001)
Atserias, J., Casas, B., Comelles, E., GonzĆ”lez, M., PadrĆ³, L., PadrĆ³, M.: Freeling 1.3: Syntactic and semantic services in an open-source nlp library. In: Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006), ELRA (2006)
Bick, E.: A constraint grammar-based parser for spanish. In: Proceedings of TIL 2006 - 4th Workshop on Information and Human Language Technology, RibeirĆ£o, Preto (2006)
Schmidt, H.: Probabilistic part-of-speech tagging using decision trees. In: Proceedings of the International Conference on New Methods in Language Processing (1994)
DĆaz-Negrillo, A., Meurers, D., Valera, S., Wunsch, H.: Towards interlanguage POS annotation for effective learner corpora in SLA and FLT. Language Forum 36(1ā2) (2010)
Granger, S.: Error-tagged learner corpora and CALL: A promising synergy. CALICO JournalĀ 20(3), 465ā480 (2003)
Aarts, J., Granger, S.: Tag sequences in learner corpora: a key to interlanguage grammar and discourse. In: Learner English on Computer, pp. 132ā141. Longman, Redwood City (1998)
Tono, Y.: A corpus-based analysis of interlanguage development: analysing POS tag sequences of EFL learner corpora. In: Practical Applications in Language Corpora (1999)
Heift, T., Schulze, M.: Errors and Intelligence in Computer-Assisted Language Learning: Parsers and Pedagogues. Routledge Studies in Computer Assisted Language Learning (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Valverde IbaƱez, M.P. (2011). An Evaluation of Part of Speech Tagging on Written Second Language Spanish. In: Gelbukh, A.F. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2011. Lecture Notes in Computer Science, vol 6608. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19400-9_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-19400-9_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19399-6
Online ISBN: 978-3-642-19400-9
eBook Packages: Computer ScienceComputer Science (R0)