Abstract
The design of data collection experiments involving human participants is a common task in Referring Expression Generation (REG) and related fields. Many (or most) REG data collection tasks are implemented by making use of a human–computer (e.g., web-based) communicative setting, in which participants do not have any particular addressee in mind and do not receive any feedback regarding the appropriateness (e.g., uniqueness) of the descriptions that they produce. Others, at a possibly higher cost, make use of participant pairs engaged in some form of dialogue in which hearers may provide feedback allowing speakers to rephrase ambiguous or otherwise ill-formed descriptions. Leaving the issue of cost aside, however, it remains unclear whether the two methods elicit similar referring expressions for the purpose of REG research. To shed light on this issue, this paper presents a REG corpus built under three experimental conditions: a standard human–computer (or web-based) setting in which no feedback is available to the speaker, and two settings in which feedback regarding the appropriateness of the description may be provided either by an automated parsing tool or by a second participant at the receiving end of the communication. The corpus contains fully annotated descriptions in two domains—simple geometric objects and realistic human face images—and it is provided as a resource for the training and testing of REG algorithms in these communicative settings.
Similar content being viewed by others
Notes
When there is no risk of confusion, we will hereby use the terms ‘speaker’ and ‘hearer’ for both spoken and written communication.
However, in certain conditions participants were allowed to refer to screen positions as in, e.g., ‘the green chair on the top row’.
Stimulus images courtesy of Michael J. Tarr, Center for the Neural Basis of Cognition and Department of Psychology, Carnegie Mellon Univ. Funding provided by NSF award 0339122.
Images 03 to 13 in Dale and Viethen (2009).
Images 01 to 11 in Paraboni et al. (2017)
References
Clarke, A. D. F., Elsner, M., & Rohde, H. (2013). Where’s Wally: The influence of visual salience on referring expression generation. Frontiers in Psychology, 4, 329.
Dale, R., & Reiter, E. (1995). Computational interpretations of the Gricean maxims in the generation of referring expressions. Cognitive Science, 19, 233–263.
Dale, R., & Viethen, J. (2009). Referring expression generation through attribute-based heuristics. In 12th European workshop on natural language generation (pp. 58–65). Athens, Greece: Association for Computational Linguistics.
dos Santos Silva, D., & Paraboni, I. (2015). Generating spatial referring expressions in interactive 3D worlds. Spatial Cognition & Computation, 15(03), 186–225.
Ferreira, T. C., Moussallem, D., Kádár, Á., Wubben, S., & Krahmer, E. (2018). NeuralREG: An end-to-end approach to referring expression generation. In 56th annual meeting of the association for computational linguistics (1959–1969). Melbourne, Australia: Association for Computational Linguistics.
Ferreira, T. C., & Paraboni, I. (2014). Classification-based referring expression generation. In Computational linguistics and intelligent text processing (CICLing-2014), Lecture Notes in Computer Science 8403 (pp. 481–491). Kathmandu, Nepal: Springer.
Gargett, A., Garoufi, K., Koller, A., & Striegnitz, K. (2010). The GIVE-2 corpus of giving instructions in virtual environments. In Proceedings of LREC-2010. Valletta, Malta: ELRA.
Gatt, A., Belz, A., & Kow, E. (2008). The TUNA challenge 2008: Overview and evaluation results. In Fifth international natural language generation conference (INLG-2008) (pp. 198–206). Salt Fork, Ohio, USA: The Association for Computer Linguistics.
Gatt, A., & Krahmer, E. (2018). Survey of the state of the art in natural language generation: Core tasks, applications and evaluation. Journal of Artificial Intelligence Research, 61, 65–170.
Gatt, A., Sluis, I., & van der Deemter, K. van. (2007). Evaluating algorithms for the generation of referring expressions using a balanced corpus. In Proceedings of ENLG-07. Schloss Dagstuhl, Germany: Association for Computational Linguistics.
Goldberg, L. R. (1990). An alternative description of personality: The Big-Five factor structure. Journal of Personality and Social Psychology, 59, 1216–1229.
Hervás, R., Arroyo, J., Francisco, V., Peinado, F., & Gervás, P. (2015), 12. Influence of personal choices on lexical variability in referring expressions. Natural Language Engineering, 1–34 (in press).
Howcroft, D. M., Vogels, J., Demberg, V. (2017). G-TUNA: A corpus of referring expressions in German including duration information. In Proceedings of INLG-2017 (pp. 149–153). Santiago de Compostela, Spain: Association for Computational Linguistics.
Kazemzadeh, S., Ordonez, V., Matten, M., & Berg, T. (2014). ReferItGame: Referring to objects in photographs of natural scenes. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) (pp. 787–798). Doha, Qatar: Association for Computational Linguistics.
Koolen, R., Gatt, A., Goudbeek, M., & Krahmer, E. (2011). Factors causing overspecification in definite descriptions. Journal of Pragmatics, 43, 3231–3250.
Koolen, R., & Krahmer, E. (2010). The D-TUNA corpus: A Dutch dataset for the evaluation of referring expression generation algorithms. Valletta: ELRA.
Krahmer, E., & van Deemter, K. (2012). Computational generation of referring expressions: A survey. Computational Linguistics, 38(1), 173–218.
Krahmer, E., van Erk, S., & Verleg, A. (2003). Graph-based generation of referring expressions. Computational Linguistics, 29(1), 53–72.
Paraboni, I., Galindo, M., & Iacovelli, D. (2017). Stars2: A corpus of object descriptions in a visual domain. Language Resources and Evaluation, 51(2), 439–462.
Paraboni, I., Monteiro, D. S., & Lan, A. G. J. (2017). Personality-dependent referring expression generation. In Text, Speech and Dialogue (TSD-2017) Lecture Notes in Artificial Intelligence. vol. 10415 (pp. 20–28). Prague, Czech Republic: Springer-Verlag.
Paraboni, I., & van Deemter, K. (1999). Issues for the generation of document deixis. In Procs. of workshop on deixis, demonstration and deictic belief in multimedia contexts, in association with the 11th Eeuropean summers school in logic, language and information (esslli99) (pp. 44–48).
Ramos, R. M. S., Neto, G. B. S., Silva, B. B. C., Monteiro, D. S., Paraboni, I., & Dias, R. F. S. (2018). Building a corpus for personality-dependent natural language understanding and generation. In 11th international conference on language resources and evaluation (LREC-2018) (pp. 1138–1145). Miyazaki, Japan: ELRA.
Righi, G., Peissig, J. J., & Tarr, M. J. (2012). Recognizing disguised faces. Visual Cognition, 20(2), 143–169.
Silva, B. B. C., & Paraboni, I. (2018). Learning personality traits from Facebook text. IEEE Latin America Transactions, 16(4), 1256–1262.
Tarenskeen, S., Broersma, M., & Geurts, B. (2014). Referential overspecification: Colour is not that special. In Refnet workshop on psychological and computational models of reference comprehension and production. Edinburgh, UK.
Teixeira, C. V. M., Paraboni, I., Silva, A. S. R., & da Yamasaki, A. K. (2014). Generating relational descriptions involving mutual disambiguation. In Computational linguistics and intelligent text processing (CICLing-2014), Lecture Notes in Computer Science 8403 (pp. 492–502). Kathmandu, Nepal: Springer.
van Deemter, K. (2002). Generating referring expressions: Boolean extensions of the incremental algorithm. Computational Linguistics, 28(1), 37–52.
van Deemter, K. (2016). Computational models of referring. A study in cognitive science. Cambridge: MIT Press.
van Gompel, R., Gatt, A., Krahmer, E., & Deemter, K. V. (2014). Testing computational models of reference generation as models of human language production: The case of size contrast. In Refnet workshop on psychological and computational models of reference comprehension and production. Edinburgh, UK.
Viethen, J., & Dale, R. (2010). Speaker-dependent variation in content selection for referring expression generation. In Proceedings of the Australasian language technology association workshop 2010 (pp. 81–89). Melbourne, Australia: Australasian Language Technology Association.
Viethen, J., Mitchell, M., & Krahmer, E. (2013). Graphs and spatial relations in the generation of referring expressions. In 14th European workshop on natural language generation (pp. 72–81). Sofia, Bulgaria: Association for Computational Linguistics.
Acknowledgements
This work has been supported by Grant # 2016/14223-0, São Paulo Research Foundation (FAPESP).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
da Silva Rocha, D., Paraboni, I. Building referring expression corpora with and without feedback. Lang Resources & Evaluation 54, 875–891 (2020). https://doi.org/10.1007/s10579-020-09497-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10579-020-09497-2