Skip to main content
Log in

A practical primer on processing semantic property norm data

  • Research Article
  • Published:
Cognitive Processing Aims and scope Submit manuscript

Abstract

Semantic property listing tasks require participants to generate short propositions (e.g., \({<}{} barks {>}\), \({<}{} has\,\, fur {>}\)) for a specific concept (e.g., DOG). This task is the cornerstone of the creation of semantic property norms which are essential for modeling, stimuli creation, and understanding similarity between concepts. Despite the wide applicability of semantic property norms for a large variety of concepts across different groups of people, the methodological aspects of the property listing task have received less attention, even though the procedure and processing of the data can substantially affect the nature and quality of the measures derived from them. The goal of this paper is to provide a practical primer on how to collect and process semantic property norms. We will discuss the key methods to elicit semantic properties and compare different methods to derive meaningful representations from them. This will cover the role of instructions and test context, property preprocessing (e.g., lemmatization), property weighting, and relationship encoding using ontologies. With these choices in mind, we propose and demonstrate a processing pipeline that transparently documents these steps, resulting in improved comparability across different studies. The impact of these choices will be demonstrated using intrinsic (e.g., reliability, number of properties) and extrinsic measures (e.g., categorization, semantic similarity, lexical processing). This practical primer will offer potential solutions to several long-standing problems and allow researchers to develop new property listing norms overcoming the constraints of previous studies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2

Similar content being viewed by others

Notes

  1. Throughout this article, <features> will be distinguished from CUES using angular brackets and italic font.

  2. A packrat project compilation is available on GitHub for reproducibility (Ushey et al. 2018), and this manuscript was written in Rmarkdown with papaja (Aust and Barth 2017).

  3. For transparency, the updated csv file should be renamed, which also practically keeps one from overwriting their adjustments if they rerun their code. The csv should be loaded as spelling.dict to continue with the code below.

  4. We mainly focus on lemmatization and do not proceed stemming the word because it introduces additional ambiguity. More specifically, stemming involves processing words using heuristics to remove affixes or inflections, such as ing or s. The stem or root word may not reflect an actual word in the language, as simply removing an affix does not necessarily produce the lemma. For example, in response to AIRPLANE, <flying> can be easily converted to <fly> by removing the ing inflection. However, this same heuristic converts the feature <wings> into <w> after removing both the s for a plural marker and the ing for a participle marker.

  5. These results were lemmatized by creating a lookup dictionary from the features listed in the Buchanan et al. (2019) norms.

References

Download references

Acknowledgements

We would like to thank the editor and two anonymous reviewers for their helpful comments in shaping this manuscript.

Funding

This work was supported by the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie Grant Agreement No. 702655 and by the University of Padua (SID 2018) to MM.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Erin M. Buchanan.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee (include name of committee + reference number) and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Guest-editor: Barry Devereux (Queen’s University Belfast); Reviewers: Anna Rogers (University of Massachusetts Lowell) and a second researcher who prefers to remain anonymous In addition to information on editor and reviewers.

This manuscript is part of the special topic on ‘Eliciting Semantic Properties: Methods and Applications’ guest-edited by Enrico Canessa, Sergio Chaigneau, Barry Devereux, and Alessandro Lenci.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Buchanan, E.M., De Deyne, S. & Montefinese, M. A practical primer on processing semantic property norm data. Cogn Process 21, 587–599 (2020). https://doi.org/10.1007/s10339-019-00939-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10339-019-00939-6

Keywords

Navigation