This shoe, that tiger: Semantic properties reflecting manual affordances of the referent modulate demonstrative use

Roberta Rocca; Kristian Tylén; Mikkel Wallentin

doi:10.1371/journal.pone.0210333

Abstract

Demonstrative reference is central to human communication. But what influences our choice of demonstrative forms such as “this” and “that” in discourse? Previous literature has mapped the use of such “proximal” and “distal” demonstratives onto spatial properties of referents, such as their distance from the speaker. We investigated whether object semantics, and specifically functional properties of referents, also influence speakers’ choices of either demonstrative form. Over two experiments, we presented English, Danish and Italian speakers with words denoting animate and inanimate objects, differing in size and harmfulness, and asked them to match them with a proximal or a distal demonstrative. Objects that offer more affordances for manipulation (smaller and harmless) elicited significantly more proximal demonstratives. These effects were stronger for inanimate referents, in line with the predictions of sensory-functional views on object semantics. These results suggest that demonstrative use may be partly grounded on manual affordances, and hints at the possibility of using demonstratives as a proxy to investigate the organization of semantic knowledge.

Citation: Rocca R, Tylén K, Wallentin M (2019) This shoe, that tiger: Semantic properties reflecting manual affordances of the referent modulate demonstrative use. PLoS ONE 14(1): e0210333. https://doi.org/10.1371/journal.pone.0210333

Editor: Søren Wichmann, Leiden University, NETHERLANDS

Received: July 11, 2018; Accepted: December 20, 2018; Published: January 7, 2019

Copyright: © 2019 Rocca et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All the data and code are publicly available on the Open Science Framework at osf.io/gnh5s.

Funding: RR is funded by the DCOMM project (Deictic Communication - www.dcomm.eu). This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie Actions grant agreement No 676063. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Spatial demonstratives: Function and deictic nature

Spatial demonstratives are referencing expressions present in all languages [1]. They are among the first items to be acquired in early development [1–6]. Moreover, demonstrative determiners (e.g. this, that in English) and adverbials (here, there) are by far the most prominent items in the lexicon of children between 1 and 2 years of age, with that being the most frequent words for this age group in the CHILDES database [7]. Alongside their role in language development, demonstratives are also among the most frequent words in adults’ lexicon [8]. While not explicitly encoding any specific information about the referent, words like this and that are used to establish a joint focus of attention on concrete objects in the physical space (exophoric usage), or on items in discourse space (endophoric usage) [1, 9–12]. The ability to coordinate attention on a referent is a central building block of human social interaction and enables mutual engagement in shared practices [13]. This pivotal function in communication grants demonstratives a special status among the variety of referencing tools languages are endowed with.

Interestingly, both production and comprehension of spatial demonstratives hinge upon the perceptual context of the utterance and the multimodal communicative signals they co-occur with. When used to denote entities in the physical environment, demonstratives are usually coupled with a pointing gesture which enables disambiguation of the intended referents among competing objects [14–17]. Given their tight link with visuo-motor communicative signals and their role in supporting fundamental cognitive processes, such as managing joint attention, it has been hypothesized that demonstratives might be primordial elements in the emergence of language [7]. Indeed, differently from all other function words, the deictic roots of demonstratives cannot be traced back to any content words, which provides evidence in favor of demonstratives having emerged very early in language evolution [7].

Crucially, spatial demonstratives are deictic expressions, i.e. lexical items that differ from other referencing expressions (such as nouns) in that they do not unambiguously denote the intended object, but rather provide instructions on how to locate it among various competing referents. Without any contextual information, the meaning of words such as this or that is opaque: knowledge of some elements of the context of utterance is therefore required in order to single out what they are meant to refer to. In the case of spatial demonstratives, such knowledge can either consist of information on the perceptual context of the utterance (e.g. position of speaker, addressee and/or other objects), or on discourse context (e.g. common ground between interlocutors) [18].

Demonstratives do not carry univocal information on the intended referent, but demonstrative systems across all languages have multiple lexical forms, which encode some deictic contrast facilitating reference resolution [1]. All languages encode at least one distinction between so-called proximal and distal demonstratives, either by explicit lexical forms, such as in the contrast between this and that in English, or via reinforcing elements, such as in the contrast between the Danish expressions “den her” and “den der” [1]. Some languages have more complex systems with additional deictic distinctions. In the simplest, dyadic case, the use of either a proximal or a distal form is likely to convey information about the distance of the position of the referent within the search space (e.g. close or far from the speaker). Complex systems including three or more distinct demonstrative forms either lexicalize more fine-grained information on the distance of the referent relative to the speaker (e.g. medial distances), or the location of the referent relative to the addressee, or may even explicitly encode perceptual features of the referent such as its visibility [4].

This or that: Distance-based or functional contrast?

It is thus widely accepted that the distance between the referent and the speaker and/or the addressee is a crucial factor when it comes to describing the usage criteria of proximal and distal demonstrative forms (but see [19–21] for complementary perspectives). However, the detailed usage profile of the proximal/distal contrast is still a subject of debate, receiving more and more attention in experimental literature.

In a series of experiments, Coventry and colleagues [22–23] have uncovered a mapping between the proximal/distal contrast and the functional organization of space into peripersonal and extrapersonal space, that is, into space within and outside manual reach. Results are consistent across several studies targeting a number of genetically heterogeneous languages. Interestingly, the mapping between referent location and demonstratives seems to be sensitive to the same dynamic adjustments that can modify the boundaries between extrapersonal and peripersonal space. The use of tools, ownership of the referent, and familiarity seem to affect the scope of the proximal deictic space [22–24].

In a reaching task involving hand movements, Bonfiglioli and colleagues [25] have further explored the relationship between functional organization of space and demonstrative reference using semantic priming. In the study, participants were primed with either a proximal or a distal demonstrative, and then asked to reach for objects placed at two possible distances within the participant’s reach. By looking at reaction times in the initiation of reaching movements, they observed semantic interference effects when the demonstrative used for priming and the object’s location were incongruent. This suggests that the contrastive nature of demonstrative systems is sensitive to relative distances between competing referents even within peripersonal space. Additionally, in a recent study [26], we have shown that, when presented with competing referents in a two-dimensional plane extending away from the participant, he/she displays a lateralized bias for proximal demonstratives in favor of the pointing hand. This asymmetry in the organization of space in spatial deixis suggests that the frame of reference for distance-based demonstrative contrasts might be centered on the dominant hand, rather than on the head or the locus of foveal fixation.

Taken together, this corpus of experimental evidence converges on showing that the use of demonstratives could be grounded in representations of objects in terms of their functional properties, including the extent to which they allow manual interaction. In this respect, distance from the speaker is only one of the relevant factors. As we will explore in the next section, a range of semantic features can shape the functional profile of a referent, and therefore possibly modulate lexical preferences for different demonstrative forms.

Spoons, rockets and other dangerous things: Object semantics and demonstrative use

In spite of a growing interest in the variety of factors that can modulate preferences for proximal or distal demonstratives (e.g. object token properties such as familiarity, visibility and ownership [23]), previous studies have largely considered semantic features of the referent (i.e. related to object type) as irrelevant to the usage profile of spatial demonstratives.

As outlined above, however, there is increasing evidence suggesting that the distinction between proximal and distal demonstratives maps onto differences in the extent to which referents afford grasp and/or manipulation. Arguably, such affordances do not only depend on whether an object is located within reach, but also on its physical properties, such as object size, and on more abstract properties, such as its harmfulness, or its familiarity.

Therefore, if the assumption that demonstrative use is sensitive to gradient manual affordances holds true, it can be expected that a preference for proximal demonstratives would be observed for referents whose semantic features result in greater affordances for manipulation, regardless of their position in space.

As mentioned, lower-level properties such as size are intuitively some of the core dimensions in shaping an object’s functional profile. The ability to manipulate an object obviously depends on the object being small enough to be graspable by a human hand. If demonstrative use is indeed tied to manual affordances, then smaller objects should be more likely to be referred to via a proximal demonstrative than bigger objects, as smaller objects tend to afford manipulation more easily than bigger ones.

Alongside lower-level properties, additional cognitive dimensions are also directly relevant to manual affordances. It has been pointed out that the functional organization of space into peripersonal and extrapersonal space responds not only to the need for representing action possibilities, but also to defensive purposes [27]. Indeed, it has been found that object valence can interact with perceptual judgements of reachability [28–29]. Moreover, harmfulness can modulate space and object perception in peripersonal space. Peripersonal space tends to shrink for defensive purposes when harmful or undesirable objects are present, and it has been argued that common neural resources might be responsible for modulating spatial attention in peripersonal space, monitoring harmful events, and selecting and coordinating defensive behavior [30–34]. Furthermore, reachability judgements tend to be influenced by the on-line relationship between the object and the subject. The extension of peripersonal space is reduced more substantially when harmful objects are oriented towards the subject compared to when oriented away, regardless of the perceived degree of harmfulness [35]. As demonstrative use is grounded in a dynamic functional organization of space oriented to manual grasp [22–24, 26], it is reasonable to expect that the degree of harmfulness of a referent could modulate the usage profile of spatial demonstratives. More specifically, if these hypotheses hold true, stimuli perceived as dangerous should be more likely to trigger the use of a distal demonstrative than harmless referents.

The animate-inanimate distinction: A sensory-functional prediction

In addition to properties such as size and harmfulness, existing literature suggests that the distinction between animate and inanimate objects could be another relevant window into the relation between semantics and demonstrative contrasts.

The distinction between animals and artefacts has gained considerable attention in the scientific literature, as it seems to be one of the main dimensions along which human semantic knowledge is organized. Since the 1980s, several neuropsychological studies have shown that semantic knowledge of animate and inanimate objects can be selectively impaired as a consequence of focal brain lesions [36–38]. However, the exact profile of such dissociations and the underlying causes are far from uncontroversial.

In a domain-specific interpretation, the dissociation in cognitive deficits reported in the literature is claimed to reflect an evolved modular organization of semantic knowledge with domain-specific systems, supported by separate neurobiological mechanisms [39].

On the other hand, sensory-functional theories (SFT) hypothesize that the observed dissociations in semantic knowledge for living and non-living things can be explained in terms of lower-level sensory and functional properties of objects [38, 40–43]. Sensory-functional theories explain the observed dissociations in terms of the relative importance of sensory and functional properties in the representation of living and non-living things. While sensory features are relatively more important in discriminating between living things, functional properties of objects are attributed more importance in discriminating between different types of artefacts. The observed animate/inanimate dissociation in semantic knowledge is thus explained by a distinction between sensory processing (damage to which primarily results in impaired knowledge for animate beings) and functional representations, where lesions disproportionally impair semantic knowledge for tools and artefacts [38]. More recent formulations of the sensory-functional approach to semantic knowledge have reframed the distinction between animate and inanimate beings in terms of differences in the ratio between sensory and functional features relevant to the representation of category tokens [44]. This ratio is claimed to be larger for animate beings compared to artefacts, as functional features tend to be more prominent in the representation of and discrimination between inanimate objects, while sensory features are equally relevant for the representation of both categories.

Sensory-functional approaches to semantic knowledge have interesting implications with regards to the relationship between object semantics and spatial demonstratives. First, the distinction between animate and inanimate things can itself be thought of as a distinction between objects affording manual interaction to a smaller or greater extent respectively. Intuitively, inanimate objects are often more readily represented in terms of their possibility for manipulation, compared to animate beings. A reasonable hypothesis thus is that a higher proportion of proximal demonstrative would be observed for inanimate referents, compared to animate referents. However, sensory-functional theories additionally posit that representations of inanimate objects are more sensitive to variability along functional dimensions of the feature space than representations of animate objects. If this hypothesis holds true, then differences in functional features, such as size and harmfulness, should determine larger differences in the proportion of proximal vs distal demonstratives between tokens of the inanimate category, compared to those observed between animate beings. In more concrete terms, the possibility for interaction with animals is less determined by their size and harmfulness than inanimate objects (small, harmless animals also tend to run away…). Crucially, testing such predictions in an experimental fashion would not only contribute to elucidating the usage profile of spatial demonstratives, but also provide insights on core questions of the debate on the organization of human semantic knowledge.

The present study

In the present study, we employed a simple elicitation paradigm in order to investigate whether semantic properties of objects, i.e. differences in affordance for manual interaction, systematically influence speakers’ preferences for proximal or distal demonstrative forms.

We tested speakers of English, Danish and Italian over two experiments. These three languages all have dyadic demonstrative systems, that is, they explicitly encode a simple binary contrast between so-called proximal and distal demonstrative forms. The choice of a cross-linguistic sample was motivated by the aim of countering language specific phenomena, rather than by expectations for cross-linguistic differences in patterns of demonstrative use.

The experiments were distributed in the form of a multiple-choice online survey. Participants were presented with concrete nouns and asked to pair the words with either a proximal or a distal demonstrative, based on their first and most immediate preference. No further linguistic or perceptual context was provided. This was meant to rule out possible confounds due to contextual or co-textual effects, so that observed systematic preferences could only be driven by properties intrinsic to the stimulus words.

Stimulus words differed along three main semantic dimensions denoting either: 1) animate or inanimate referents; 2) big or small referents; 3) harmful or harmless referents.

Levels of these three semantic dimensions involve a common distinction in the degree to which referents afford manipulation. Small referents are likely to offer more manual affordances than big referents. The same holds for harmless compared to harmful referents, and for inanimate compared to animate referents.

Consequently, we expected a preference for proximal demonstratives: 1) for nouns denoting small referents compared to big referents; 2) for nouns denoting harmless referents, compared to harmful referents; 3) for nouns denoting inanimate referents compared to animate referents. We expected to observe these effects as main effects of each of the variables of interest.

Moreover, in line with the sensory-functional view on semantic knowledge, we predicted that the effect of harmfulness and size would be stronger for inanimate objects, compared to animate beings. We therefore expected to observe two-way interactions between animacy and harmfulness, and between animacy and size. We expected our predictions to hold across the three languages tested in the experiment.

The aim of the study was two-fold. On the one hand, we aimed at contributing to elucidate the usage profile of the proximal/distal demonstrative contrast. On the other hand, we aimed at exploring the possibility of using demonstratives to probe the organization of human semantic knowledge.

As mentioned, the study is articulated into two experiments. The first study tested speakers of all three languages (Experiment 1). We then replicated it in Italian and Danish (Experiment 2) using the same design and procedure, but with a different stimulus set.

Data and code are available on the Open Science Framework at osf.io/gnh5s.

Experiment 1

Methods

Participants.

In Experiment 1, we collected data from 131 English speakers (96 native), 102 Danish speakers (101 native), and 131 Italian speakers (126 native).

Participants were recruited online. The survey was advertised via social media and institutional web platforms. No information on the purpose of the experiment was provided in advance. Participants took part in the experiment voluntarily, and they did not receive monetary compensation for their participation.

At the beginning of the study, participants were provided with instructions and a detailed consent form, and they consented to the conditions of participation by proceeding to the first experimental trial. At the end of the experiment, participants were asked to provide information on their gender and age. Moreover, they were asked to specify whether they were native speaker of the language of the survey. If this was not the case, they were further asked to specify their native language. In the English version of the experiment, those who stated having English as their native language were asked to specify which variety of English they spoke, choosing between American English, British English, or other. Due to the limitations of our convenience sampling method, this information was not included in the analysis.

Participants were further asked to specify whether they knew the meaning of all the words presented in the survey. If not, they were asked to tick the words they did not know, and the corresponding data points were then excluded from the analysis. Only data from participants who completed the survey were included in the analysis.

As two participants from the English dataset reported not to know the meaning of eight out of thirty-two experimental words, we excluded them from the analysis. All other participants reported to know the meaning of the vast majority of words (median: 40 out of 40 words, range: 36–40), thus no further data were discarded. Both L1 and L2 speakers were included in the analysis.

The study received ethical approval from the Ethical Committee of the Cognition and Behaviour Lab at Aarhus University, Denmark.

Platform.

The experiment was hosted by the online platform Qualtrics Experience Management Platform (Qualtrics, Provo, UT, USA). Access to the platform was provided by the School of Business and Social Sciences (BSS) at Aarhus University.

Task and procedure.

Participants in both experiments were presented with 40 individual words. Participants had to match each word with either a proximal or a distal demonstrative, that is, this or that in English, den/det her or den/det der in Danish, and questo/a or quello/a in Italian. Target words were displayed one at a time at the top of the screen together with the two possible demonstrative forms positioned on two separate lines below the word. Participants chose between the two demonstrative forms by ticking the box corresponding to the preferred demonstrative form. They were asked to make their choice based on their first and immediate preference. No further context nor any extra information was displayed on screen.

After having made their choice, participants could proceed to the next word. It was not possible to go back to previous trials once the response was given. A response was required in order to proceed to the next trial. The order of presentation of nouns was randomized across participants. The order in which the two demonstratives were displayed on screen was randomized across trials and participants.

Stimuli.

All the words were singular nouns. Out of 40 nouns, 32 were experimental words, and 8 were fillers. All the words used for Experiment 1 are reported in Table 1 (in English). The full stimulus list in all three languages is reported in the Supporting Information (S1 Table).

Download:

Table 1. Stimulus words for Experiment 1, English.

https://doi.org/10.1371/journal.pone.0210333.t001

Experimental words always referred to concrete objects. Animacy (animate/inanimate), Size (big/small) and Harmfulness (harmful/harmless) of the referent were the binary variables of interest, yielding 2x2x2 combinations per language. Participants were presented with four words from each possible combination, which resulted in a within-subject repeated measures design. Fillers denoted abstract entities.

Target words were chosen from a semantic knowledge database (in English) including 1000 concrete nouns rated along 218 semantic dimensions on an integer scale from 1 to 5 [45].

For the purpose of the study, words were selected according to their ratings on the dimensions “Is it an animal?”, corresponding to the variable Animacy in our experimental design, “Is it bigger than a loaf of bread?”, corresponding to the variable Size, and “Is it dangerous?”, corresponding to the variable Harmfulness. Different dimensions were available which could be relevant for the variable Size relying on comparisons in size between the target objects and different referent objects. Among them, however, we chose the dimension that best mirrored a distinction between objects whose size allows manual grasp and objects whose size does not.

All nouns labelled as non-animate in our experiment were rated less than 3 along the Animacy dimension, while animate referents were rated 3 or more. Nouns labelled as small referents were rated less than 3 on the Size dimension, while big referents were rated 3 or more. Nouns denoting dangerous referents were rated 3 or more on the Harmfulness dimension, while harmless referents were rated less than 3. All words were translated into Italian and Danish by native speakers of the two languages (the paper’s first and last authors). Danish translations of a subset of the words present in the database had been previously used and validated in the context of a neuroimaging study on word processing [46].

Demonstrative expressions.

In English, participants could choose to match the noun with either “this” or “that”.

In Danish, demonstratives in their adjectival use were created by combining the articles “den” or “det” (roughly equivalent to “it” in English) with the demonstrative adverbs “her” (“here”, in English) or “der” (“there”, in English). As Danish has two grammatical genders, demonstratives were matched to the grammatical gender of the noun. For nouns in common gender, participants could choose between “den her” and “den der”, while for nouns in neutral gender, “det her” and “det der” were presented as possible options.

Italian also has two grammatical genders, and demonstratives in their adjectival use have to be matched for the gender of the noun. For nouns starting with a consonant sound, “questo” and “quello” / “quel” are respectively the proximal and distal masculine forms, while “questa” and “quella” are the feminine forms. The masculine demonstrative “quel” is used for all nouns except those starting with semi-consonantic sounds (i, y and j), with sibilants (s and z), and with other consonant clusters (gn, sc, pn and ps), where “quello” is the correct form. For nouns starting with a vowel, the final vowel of both feminine and masculine demonstratives “questo” / “questa” and “quello” / “quella” is elided, yielding the forms “quest’” and “quell’” respectively. For example, when the feminine demonstratives “questa” and “quella” are to be coupled with the feminine noun “anatra” (duck), the resulting form would be “quest’anatra” or “quell’anatra”. For Italian, we decided to uniformly present the forms “questo” / “quello” for masculine words and “quello” / “quella” for feminine words regardless of the first letter of the noun. We decided to do so in order to obtain uniformity of methods across languages, and because we anticipated that, given the presentation format, presenting the elided forms “quest’” and “quell’” not followed by a noun would have appeared strange to a proficient speaker, even though such forms would have been the grammatically correct ones.

Word frequency.

In order to control for potential confounds due to effects of word frequency, we compared frequency of experimental words across levels of each of the experimental factors. For English, we extracted word frequencies from the British National Corpus [8]. For Italian and Danish, we extracted word frequencies from the respective corpora in the TenTen Corpus Family [47] from 2017. Details on the distribution of word frequencies for Experiment 1 are provided as Supporting Information (S1 Fig).

Linear regressions with animacy, harmfulness and size and their interactions as fixed effects and word frequency as outcome variable revealed no significant differences (p > .05) across levels of the experimental variables in any of the three languages.

Analysis.

Animacy, harmfulness and size were used as binary independent variables.

For the independent variable animacy, nouns were coded as either inanimate or animate. Inanimate was set as reference level. For the independent variable Size, nouns were coded as either small or big. Small was set as reference level. For the independent variable Harmfulness, nouns were coded as either harmless or harmful. Harmless was set as the reference level. For the independent variable Language, Danish was set as reference level.

The outcome variable coded for the demonstrative chosen at each trial. Responses were coded as either distal or proximal. Distal was set as reference level, while proximal was coded as success outcome. All instances of “this” in English, “den/det her” in Danish, and “questo” / “questa” in Italian were coded as proximal demonstratives, while all instances of “that” in English, of “den/det der” in Danish, and “quello” / “quella” in Italian were re-coded as distal demonstratives.

Data visualization and analysis was performed using RStudio, version 1.1.383 (RStudio Team 2016).

Data were analyzed using mixed-effects logistic regression implemented via the function glmer from the package lme4 [48]. Parameters for the logistic regression model were estimated using maximum likelihood estimation with Laplace approximation, and inferences were drawn via likelihood ratio tests. R² estimates reported in the analysis and discussion are computed using the function r2 from the R package sjstats [49].

Data from all languages were analyzed in a single logistic regression model. The fixed effects structure included all the three variables of interest, a categorical predictor coding for language, and the full interaction structure. The random effects structure included a random intercept for participants.

Results

Fig 1 provides an overview of the data from Experiment 1.

Download:

Fig 1. Proportion of proximal demonstratives across languages, Experiment 1.

Small, inanimate and harmless nouns tend to be denoted with a proximal demonstrative across languages. The effects of size and harmfulness tend to be stronger for inanimate, compared to animate referents.

https://doi.org/10.1371/journal.pone.0210333.g001

The analysis revealed an overall preference for distal demonstratives, in line with corpus frequency data for each of the three languages, β = -1.75, se = 0.16, z = -10.98, p < .001.

There was a significant main effect of language, indicating that the proportion of distal demonstratives was overall larger in Danish compared to English, β = 0.43, se = 0.20, z = 2.13, p < .05, and compared to Italian, β = 1.65, se = 0.2, z = 8.41, p < .001. The model revealed a significant main effect of harmfulness, β = 0.74, se = 0.18, z = 4.1, p < .001, and a significant main effect of size, β = 0.56, se = 0.18, z = 3.07, p < .01, both in the predicted direction. Additionally, there was a main effect of animacy, β = 0.52, se = 0.18, z = 2.82, p < .01, as well as significant interactions between animacy and harmfulness, β = 0.91, se = 0.24, z = 3.76, p < .001, and animacy and size, β = 0.68, se = 0.24, z = 2.79, p < .01, suggesting that the effects of harmfulness and size were more pronounced for inanimate objects than for animate beings.

The analysis also hinted at unpredicted cross-linguistic differences in the effects of the experimental variables. There was a significant interaction between language and animacy, showing that the effect of animacy was stronger in Italian compared to Danish, β = -0.49, se = 0.22, z = -2.21, p < .05. Moreover, the interaction between animacy and harmfulness was significantly stronger in Danish, compared to English, β = -0.64, se = 0.31, z = -2.07, p < .05, and Italian, β = -0.98, se = 0.31, z = -3.21, p < .01, as shown by the three-way interaction between animacy, harmfulness and language. A three-way interaction between animacy, size and language indicated that the interaction between animacy and size was marginally less pronounced in English, compared to Danish, β = -0.62, se = 0.31, z = -1.97, p < .05.

The model also displayed an unpredicted three-way interaction between animacy, size, and harmfulness, β = -0.81, se = 0.33, z = -2.46, p < .05, as well as a four-way interaction between animacy, size, harmfulness and language, when comparing Danish to Italian, β = 1.09, se = 0.43, z = 2.56, p < .05. The model has a marginal R² of 0.126, and a conditional R² of 0.265.

A summary of the estimated fixed effects coefficients is reported in Table 2.

Download:

Table 2. Fixed effects for Experiment 1.

https://doi.org/10.1371/journal.pone.0210333.t002

Interim discussion

Data from the three languages all converge on showing a strong, significant main effect of harmfulness and size in the direction predicted in the introduction. The proportion of proximal demonstratives over distal demonstratives was consistently higher in the case of harmless referents compared to harmful referents, and in the case of smaller referents compared to big referents. This is in line with our hypothesis that the proximal/distal demonstrative contrast encodes a gradient distinction in the extent to which objects lend themselves to manual interaction.

A main effect of animacy was also detected, but the strength of such effect seemed to vary across languages. Additionally, the model provided strong evidence in favor of the predicted interaction between animacy and size and between animacy and harmfulness, though such interactions were significantly more robust in Danish compared to English and Italian. Taken together, the results lend support to all our predictions suggesting several, motivated semantic dimensions influence speakers’ choice of demonstratives.

The results from Experiment 1, however, leave a number of questions open to discussion.

First, we did not expect to observe strong cross-linguistic differences in the effects of interest. Based on these findings, it cannot be excluded that cross-linguistic discrepancies are due to actual differences in patterns of demonstrative use across the three languages of interest, but a number of alternative explanations are compatible with the observed patterns. Bigger sample sizes, which increase statistical power, might provide more robust insights on cross-linguistic differences in patterns of demonstrative use.

Secondly, as explained in section 2.1.5, the matching between the default distal demonstrative forms presented in the survey and stimulus words was sometimes not entirely correct from a grammatical point of view in the Italian version of the experiment. A separate analysis of the Italian data, reported in S1 Appendix, showed that this affected participants’ choices for proximal or distal demonstrative forms, with a significantly higher proportion of proximal demonstratives for cases in which the matching between stimulus word and distal demonstratives was experienced as strange. It can therefore not be ruled out that the differences between Danish and Italian data detected in the analysis could potentially be driven by such confound.

Furthermore, the unexpected three-way interaction between animacy, size and harmfulness is difficult to relate to predictions from the present study, or to expectations drawn from the literature. While it could be explained in terms of the psychological prominence of the harmfulness dimension, with the interaction between size and animacy being amplified in the case of harmful referents, such an effect still remains difficult to interpret and highlights the potential for false positives due to our complex model. Given the relatively low number of stimulus words, it also could not be excluded that such effect could be driven by particular words, more than by the manipulations in semantic features.

In order to test the reliability and replicability of the effects detected in experiment 1, we decided to collect data from a second set of stimulus words. In the second experiment, we aimed at bigger sample sizes in order to increase the statistical power of the analysis. The availability of a second dataset also enabled us to conduct a cumulative analysis of the two datasets. The availability of more stimulus words for the cumulative analysis provided the possibility of fitting a more complex and more conservative random effects structure, including intercepts for each stimulus word, in order to rule out the possibility of the observed effects being driven by specific stimulus words.