A methodological note on ordered Q-Sort ratings

doi:10.1016/j.jrp.2013.08.013

Journal of Research in Personality

Volume 47, Issue 6, December 2013, Pages 853-858

https://doi.org/10.1016/j.jrp.2013.08.013 Get rights and content

Highlights

•
Response patterns for Q-Sort and Likert-type ratings are compared.
•
Item order effects are found in Q-Sort but not in Likert-type ratings.
•
Items sorted later have less variance and are placed closer to the midpoint.
•
Item order effects attenuate some relationships with dependent measures.
•
Randomizing item presentation in Q-Sort ratings is recommended in some situations.

Abstract

Patterns of ratings using the Q-Sort method and the Likert-type method are compared. Ordering effects are found in Q-Sort ratings that are not present in Likert-type ratings. Specifically, item order is related to both item variance and item placement, such that items appearing near the end of the Q-Sort have less variance and more central placement. This finding is verified across three measures in several datasets spanning nearly 20 years of research. Such item order effects appear to attenuate average absolute relationships (covariances and correlations) between items appearing near the end of the Q-Sort and other measures. Randomization of items may be (in some situations) a viable course of action to minimize these effects at a sample level.

Introduction

The Q-Sort method is widely used in psychological research. This research includes comparing patients in a clinical setting (Block, 1978), evaluating distinctive and normative similarity of personality (Furr, 2008), and evaluating the relationships between personality and perception (Sherman, Nave, & Funder, 2013) to name only a few.¹ This paper reports the presence of substantial item ordering effects occurring in the Q-Sort method. We begin by briefly describing the Q-Sort method and several of its advantages. Then, we examine several large datasets using Q-Sorts for item order effects. After that, relationships between Q-Sort measures and dependent variables are examined for possible attenuation of relationships due to these item order effects. Lastly, implications for future research using the Q-Sort method are discussed.

Q-Sorts afford several advantages over traditional Likert-type measurements. For one, Q-Sorts are not susceptible to many response biases that can occur in Likert-type rating such as nay saying, acquiescence, extremes responding, and midpoint responding (Block, 1978). Using the Q-Sort method it is not possible for people to simply agree with every question (acquiescence bias) or pick moderate responses for every item (midpoint responding). Rather, characteristics are sorted into a fixed distribution. The Q-Sort method is also considered to be a more taxing procedure, taking longer to complete than a Likert-type rating. Presumably this leads to more valid characterizations of the target being rated. In addition, the Q-Sort method allows for comparisons of two or more targets on a large number of characteristics using a simple profile correlation (Block, 1978).

Using Block’s (1978) procedure, a Q-Sort works as follows: The rater is given a set of cards (items), describing possible characteristics of a target. The rater then sorts each item into one of three initial categories (e.g., uncharacteristic, neutral, or characteristic). After the items are sorted into these three categories, the participant further sorts the items into a pre-specified fixed distribution for the scale that the researcher is using. Often these ratings form a quasi-normal distribution whereby only a limited number of items fit into the most extreme categories (e.g., 1 and 9) while a larger portion of the items fit into the middle categories (e.g., 5). Traditionally, the Q-Sort method was conducted by hand (manually) using index-sized cards (for more detail on the Q-Sort method see Block, 1978). In more recent research, computer programs assist with the administration of this task (e.g., Sherman et al., 2010, Sherman et al., 2012, Sherman et al., 2013). In the computer-facilitated Q-Sort data reported here, the Q-Sorter program (Riverside Accuracy Project, 2013) is used, although a web-based version is now available (Funder, Guillaume, Kumagai, Kawamoto, & Sato, 2012).

In the course of conducting a preliminary analysis on some personality data collected using the aforementioned computer program, we noticed specific item order effects in participants’ responses to a Q-Sort measure of personality. In particular, we noticed that the item variances were substantially lower for items occurring later in the Q-Sort (i.e., item order was strongly and negatively correlated with item variance). We further noticed that item order was associated with an increased likelihood of having a score near the scale midpoint, such that items coming later in the sort were more likely to be placed in the middle categories. We wondered if these patterns were simply an artifact of the dataset we collected, or if something more was happening. Was this pattern of responding unique to the content in this measure or would the same pattern be found using other measures as well? Would these patterns be found in Likert-type ratings, or was the Q-Sort method the issue? Would these patterns also be found in Q-Sorts conducted by hand or was it only an issue with the computer-facilitated Q-Sorts? The result of our inquiry is an in-depth analysis of response patterns from several thousand Q-Sort and Likert-type ratings using numerous Q-Sets² gathered from many studies over the course of almost two decades.

This study examines the presence (or absence) of such item order effects in Q-Sort and Likert-type rating measures of personality, behaviors, and situations. We contrast the procedures used to collect each dataset in an effort to isolate the potential cause of item order effects. Then, this study examines relationships between Q-Sort measures and dependent variables, evaluating possible attenuation of these relationships due to item order effects.

Section snippets

Methods

The data presented here come from seven different studies. These include the Riverside Accuracy Project (RAP I; see Funder, 1995), the Riverside Accuracy Project – II (RAP II; see Letzring, Wells, & Funder, 2006), the Riverside Situation Project (RSP; see Sherman et al., 2010, Sherman et al., 2012, Sherman et al., 2013), and the Perceptions of the Thematic Apperception Test (TAT; see Serfass & Sherman, 2013). Data are also included from three, as of yet, unpublished studies: Amazon’s M-Turk

Results

First, item order effects on Variance were calculated. For each set of Q-Set ratings, the item standard deviations were correlated with the item numbers (e.g. 1,2,3 … 100 for the CAQ).⁵ Second, item order effects on item Placement were calculated. For each set of Q-Set ratings, the average absolute distance from the scale midpoint (5 for all measures used) for each item was

Discussion

This study examined data from a variety of samples using thousands of Q-Sort and Likert-type ratings of Q-Set items collected over nearly two decades. The conclusion from these data is quite clear. When items are sorted, as opposed to Likert ratings, item order is associated with item variance and the propensity to be placed near the middle categories of the sort distribution. In other words, items presented near the end of a Q-Sort rating have lower variance and are more likely to be placed in

Acknowledgments

We thank David Funder for his feedback on a previous draft of this article. The collection of some data analyzed here was supported by NIMH grant MH-42427 to David C. Funder, Principal Investigator and National Science Foundation grant BNS BCS-0642243 to David C. Funder, Principal Investigator. Any opinions, findings, conclusions, or recommendations expressed in this article are those of the individual researchers and do not necessarily reflect the views of the National Institute of Mental

References (22)

D.G. Serfass et al.
Personality and perceptions of situations from the thematic apperception test
Journal of Research in Personality
(2013)
R.A. Sherman et al.
Properties of persons and situations related to overall and distinctive personality-behavior congruence
Journal of Research in Personality
(2012)
R.A. Sherman et al.
Situational construal is related to personality and gender
Journal of Research in Personality
(2013)
D.J. Bem et al.
Predicting more of the people more of the time: Assessing the personality of situations
Psychological Review
(1978)
J. Block
The Q-sort method in personality assessment and psychiatric research
(1978)
S.R. Brown
A primer on Q methodology
Operant Subjectivity
(1993)
J. Cohen et al.
Applied multiple regression/correlation analysis for the behavioral sciences
(2003)
P.T. Costa et al.
The NEO personality manual
(1985)
R.M. Cross
Exploring attitudes: The case for Q methodology
Health Education Research
(2005)
D.C. Funder
On the accuracy of personality judgment: A realistic approach
Psychological Review
(1995)

D.C. Funder et al.

The riverside behavioral Q-sort: A tool for the description of social behavior

Journal of Personality

(2000)

Cited by (25)

Green infrastructure in water management: Stakeholder perceptions from South East Queensland, Australia
2023, Cities
Green infrastructure (GI) originated in landscape architecture and landscape ecology and is widely used as an approach to sustainable water management. However, there is no commonly accepted definition of GI for water management in the literature.
This research was undertaken in South East Queensland (SEQ), Australia, which has experienced a long-term cycle of floods and droughts. The research employed the Q-sort methodology supplemented with semi-structured interviews to understand perceptions of GI amongst various stakeholders. Twenty-seven research participants included design, planning, and engineering practitioners, government officers, scientists and community members familiar with GI. Our findings indicate these participants regard GI as a broad concept containing both natural and engineered semi-natural assets offering multiple benefits and functions, yet rarely recognised its economic benefits. Participants were divided on GI's effectiveness for drought management.
We propose a new, consolidated definition of GI for stormwater management: “GI is a strategically planned network of high-quality natural and semi-natural assets that mimics natural processes, with multiple benefits and multifunctionality, such as enhancing stormwater management and providing environmental quality, with social and economic benefits”. We recommend that water management-related policies, strategies, plans, and design guidelines in SEQ and elsewhere, should include a consistent definition of GI for water management to assist professional and community understanding and inform decision-making about flood and drought.
Is it just about me? A comparison between individual and cultural strategies of learning from failure
2022, International Journal of Educational Research Open
Citation Excerpt :
Due to the requirement to think and decide in an authentic way, students are allowed to dive deep in the subject on hand and also reflect on their true preferences, whereas with application of Likert-type surveys they may stop at the level of liking or disliking certain topics or situations. Likert-type surveys are often completed without much reflection (Serfass & Sherman, 2013) and thus provide responses of less variance, discriminatory power, or meaning (Rieber, 2020). Such problems can be avoided when Q methodology is applied, as one of its strengths is the forced sorting of statements into the sorting grid, requiring participants to rank statements relative to all other statements.
The aim of this exploratory study is to research individual and cultural strategies of learning from failure amongst German, Indian and Swedish university students. Our research provides (1) a framework of typal similarities of failure learning within the national cultures of Germany, India and Sweden, as well as (2) understanding of cultural effects on failure learning and (3) insights for entrepreneurship educators to develop programs that steer discussions and reflections on the event of failure as a likely part of the entrepreneurial process. Thus, this research provides a new brick of understanding as our results show that both culture-based strategies as well as culturally independent typical subjectivities in learning from failure exist for the three nations Germany, India and Sweden. The defined typologies can broaden our understanding of learning from failure at an intermediate level, bridging the gap between cultural and individual factors. Furthermore, our paper showcases the suitability of Q methodology to bring to front individual beliefs as well as group-specific opinions in higher education by discussing the methodological capabilities and challenges as experienced during our study.
Structuring educational decisions using the multiple sorting task: An example focusing on international placements in nursing
2017, Nurse Education in Practice
Citation Excerpt :
Card sorting is an interesting, cost-effective and fun way to learn about how people think. Its origins can be found in personal construct theory and repertory grid technique (Winter 2013), and the Q-sort (Ellingsen et al., 2010; Serfass and Sherman, 2013). Sixsmith (1986) noted how the MST could also be used within a phenomenological framework to study individual accounts and personal meanings.
Practical examples of the steps involved in the planning and execution of the multiple sorting task are frequently lacking in published reports. This article demonstrates how the multiple sorting task can be used to structure conversations with a group of health professionals planning an international placement for nursing students. Sixteen participants were drawn from diverse professional backgrounds, including academia, clinical practice, government policy, and placement administrators. Participants sorted 17 statements written on cards into categories of their choice and noted why they sorted the cards into these particular groups. Data were analysed using multidimensional scaling and qualitative perspectives. The analysis identified four key themes that detailed the participants’ views about international placements. These findings demonstrate how the multiple sorting task can be used to generate information that facilitates the examination of important facets of health care practice that universities could cover in preparing students for international placements.
A situational construal approach to healthcare experiences
2015, Social Science and Medicine
Citation Excerpt :
In the present studies, participants instead rated each RSQ item on a 9-point, Likert–type scale (1 = extremely uncharacteristic, 9 = extremely characteristic). Block (1957) compared normative Likert ratings to ipsative Q-sort ratings and described the results of the two as “fully equivalent” (p. 52), although others have found that the Q-sort approach reduces item variance for items appearing towards the end of the scale (Serfass and Sherman, 2013), and Likert ratings may produce a positivity bias on RSQ items (Frascona, 2014). Concerns over the time and specialized programming required to complete a Q-sort online outweighed concerns over a broad positivity bias.
The Situational Construal Model proposes that characteristics of persons and situations interact to influence construal of situations and resultant behavior. We apply this framework to the study of healthcare experiences in two studies.
In Study 1, mTurk users (N = 670) read vignettes of positive, neutral, or negative healthcare experiences, described their construal of the vignette, and completed individual difference measures. In Study 2, mTurk users (N = 292) recalled a recent healthcare visit and reported individual differences, visit characteristics, and outcomes following the visit.
Across both studies, personality was related to the valenced construal of healthcare experiences. In Study 2, patient and visit characteristics predicted situational construal and self-reported visit outcomes, and situational construal statistically mediated relationships between patient and visit characteristics and outcomes.
The current work supports the application of the Situational Construal Model to healthcare situations and demonstrates the importance of both person and situation variables for understanding key healthcare outcomes.
Predicting interpersonal behavior using the Inventory of Individual Differences in the Lexicon (IIDL)
2014, Journal of Research in Personality
Citation Excerpt :
Research assistants used a Q-sort computer program to rate the degree in which each RBQ item was characteristic of the behavior exhibited in that situation using nine categories (1 = extremely uncharacteristic, 9 = extremely characteristic) forming a forced-choice, quasi-normal distribution. Because data collected using Q-sorts may be susceptible to item order effects (Serfass & Sherman, 2013a), we calculated two statistics for the 24 RBQ items: the order effects on variance (i.e., the correlation between the item number and the standard deviation for each item) and item placement (the average absolute distance from the midpoint which reflects the tendency for a coder to place an item near the midpoint). Both effects were medium, r = −.39 (variance) and r = −.40 (placement).
Personality psychology relies on well-validated measures of individual differences to describe and predict behavior. A newer comprehensive measure, the Inventory of Individual Differences in the Lexicon (IIDL) has been developed, but its ability to predict actual behavior has not been examined. The present article uses the IIDL to predict directly observed behavior, as categorized by the Interpersonal Circumplex (IPC). Video recorded interviews with participants in a laboratory setting were coded for directly observable behavior. Forty-eight IIDL items had meaningful associations with the IPC. Most importantly, 25 items provided unique predictive information above and beyond a factor-level measure of personality. This suggests that comprehensive measures of personality should be considered for their additive validity in predicting interpersonal behavior.
To trust or to restrict?–mapping professional perspectives on intelligence powers and oversight in the Netherlands using Q-methodology
2024, Intelligence and National Security

View all citing articles on Scopus

View full text

Brief ReportA methodological note on ordered Q-Sort ratings

Highlights

Abstract

Introduction

Section snippets

Methods

Results

Discussion

Acknowledgments

Journal of Research in Personality

Journal of Research in Personality

Journal of Research in Personality

Predicting more of the people more of the time: Assessing the personality of situations

Psychological Review

The Q-sort method in personality assessment and psychiatric research

A primer on Q methodology

Operant Subjectivity

Applied multiple regression/correlation analysis for the behavioral sciences

The NEO personality manual

Exploring attitudes: The case for Q methodology

Health Education Research

On the accuracy of personality judgment: A realistic approach

Psychological Review

The riverside behavioral Q-sort: A tool for the description of social behavior

Journal of Personality

Brief Report
A methodological note on ordered Q-Sort ratings