Abstract
Purpose
The shoulder pain and disability index (SPADI) has been extensively evaluated for its psychometric properties using classical test theory (CTT). The purpose of this study was to evaluate its structural validity using Rasch model analysis.
Methods
Responses to the SPADI from 1030 patients referred for physiotherapy with shoulder pain and enrolled in a prospective cohort study were available for Rasch model analysis. Overall fit, individual person and item fit, response format, dependence, unidimensionality, targeting, reliability and differential item functioning (DIF) were examined.
Results
The SPADI pain subscale initially demonstrated a misfit due to DIF by age and gender. After iterative analysis it showed good fit to the Rasch model with acceptable targeting and unidimensionality (overall fit Chi-square statistic 57.2, p = 0.1; mean item fit residual 0.19 (1.5) and mean person fit residual 0.44 (1.1); person separation index (PSI) of 0.83. The disability subscale however shows significant misfit due to uniform DIF even after iterative analyses were used to explore different solutions to the sources of misfit (overall fit (Chi-square statistic 57.2, p = 0.1); mean item fit residual 0.54 (1.26) and mean person fit residual 0.38 (1.0); PSI 0.84).
Conclusions
Rasch Model analysis of the SPADI has identified some strengths and limitations not previously observed using CTT methods. The SPADI should be treated as two separate subscales. The SPADI is a widely used outcome measure in clinical practice and research; however, the scores derived from it must be interpreted with caution. The pain subscale fits the Rasch model expectations well. The disability subscale does not fit the Rasch model and its current format does not meet the criteria for true interval-level measurement required for use as a primary endpoint in clinical trials. Clinicians should therefore exercise caution when interpreting score changes on the disability subscale and attempt to compare their scores to age- and sex-stratified data.
Similar content being viewed by others
Notes
RUMM Laboratory Pty Ltd, Perth.
References
Linsell, L., Dawson, J., Zondervan, K., Rose, P., Randall, T., Fitzpatrick, R., et al. (2006). Prevalence and incidence of adults consulting for shoulder conditions in UK primary care; patterns of diagnosis and referral. Rheumatology, 45(2), 215–221.
Luime, J. J., Koes, B. W., Hendriksen, I. J. M., Burdorf, A., Verhagen, A. P., Miedema, H. S., et al. (2004). Prevalence and incidence of shoulder pain in the general population; a systematic review. Scandinavian Journal of Rheumatology, 33(2), 73–81.
Page, M. J., Huang, H., Verhagen, A. P., Gagnier, J. J., & Buchbinder, R. (2017) Outcome reporting in randomized trials for shoulder disorders: Literature review to inform the development of a core outcome set. Arthritis Care Res (Hoboken), https://doi.org/10.1002/acr.23254.
Buchbinder, R., Page, M. J., Huang, H., Verhagen, A. P., Beaton, D., Kopkow, C., et al. (2017) A preliminary core domain set for clinical trials of shoulder disorders: A report from the OMERACT 2016 shoulder core outcome set special interest group. The Journal of Rheumatology, https://doi.org/10.3899/jrheum.161123.
Roach, K. E., Budiman-Mak, E., Songsiridej, N., & Lertratanakul, Y. (1991). Development of a shoulder pain and disability index. Arthritis Care and Research, 4(4), 143–149.
MacDermid, J. C., Solomon, P., & Prkachin, K. (2006) The shoulder pain and disability index demonstrates factor, construct and longitudinal validity. BMC Musculoskeletal Disorders, https://doi.org/10.1186/1471-2474-7-12
Angst, F., Schwyzer, H. K., Aeschlimann, A., Simmen, B. R., & Goldhahn, J. (2011). Measures of adult shoulder function disabilities of the arm, shoulder, and hand questionnaire (DASH) and its short version (QuickDASH), shoulder pain and disability index (SPADI), American shoulder and elbow surgeons (ASES) society standardized shoulder assessment form, constant (Murley) score (CS), simple shoulder test (SST), Oxford shoulder score (OSS), shoulder disability questionnaire (SDQ), and Western Ontario shoulder instability index (WOSI). Arthritis Care and Research, 63, S174–S88.
Dawson, J., Harris, K. K., Doll, H., Fitzpatrick, R., & Carr, A. (2016). A comparison of the Oxford shoulder score and shoulder pain and disability index: factor structure in the context of a large randomized controlled trial. Patient Related Outcome Measures, 7, 195–203.
Roy, J. S., MacDermid, J. C., & Woodhouse, L. J. (2009). Measuring shoulder function: a systematic review of four questionnaires. Arthritis and Rheumatology, 61(5), 623–632.
Thoomes-de Graaf, M., Scholten-Peeters, G. G. M., Schellingerhout, J. M., Bourne, A. M., Buchbinder, R., Koehorst, M., et al. (2016). Evaluation of measurement properties of self-administered PROMs aimed at patients with non-specific shoulder pain and “activity limitations”: a systematic review. Quality of Life Research, 25(9), 2141–2160.
St-Pierre, C., Desmeules, F., Dionne, C. E., Fremont, P., MacDermid, J. C., & Roy, J. S. (2016). Psychometric properties of self-reported questionnaires for the evaluation of symptoms and functional limitations in individuals with rotator cuff disorders: a systematic review. Disability and Rehabilitation, 38(2), 103–122.
Hill, C. L., Lester, S., Taylor, A. W., Shanahan, M. E., & Gill, T. K. (2011). Factor structure and validity of the shoulder pain and disability index in a population-based study of people with shoulder symptoms. BMC Musculoskeletal Disorders, 12, 8.
Chester, R., Jerosch-Herold, C., Lewis, J., & Shepstone, L. (2017). The SPADI and QuickDASH are similarly responsive in patients undergoing physical therapy for shoulder pain. The Journal of Orthopaedic and Sports Physical Therapy, 47(8), 538–547.
Cano, S. J., & Hobart, J. C. (2011) The problem with health measurement. Patient preference and adherence, 5, 279–290.
Hobart, J. C., Cano, S. J., Zajicek, J. P., & Thompson, A. J. (2007). Rating scales as outcome measures for clinical trials in neurology: problems, solutions, and recommendations. The Lancet Neurology, 6(12), 1094–1105.
Andrich, D. (1988). Rasch models for measurement (p. 94). London: Sage.
Hagquist, C., Bruce, M., & Gustavsson, J. P. (2009). Using the Rasch model in nursing research: an introduction and illustrative example. International Journal of Nursing Studies, 46(3), 380–393.
Mokkink, L. B., Terwee, C. B., Patrick, D. L., Alonso, J., Stratford, P. W., Knol, D. L., et al. (2010). The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. Journal of Clinical Epidemiology, 63(7), 737–745.
Wright, B. D., & Linacre, J. M. (1989). Observations are always ordinal—measurements, however, must be interval. Archives of Physical Medicine and Rehabilitation, 70(12), 857–860.
Tennant, A., McKenna, S. P., & Hagell, P. (2004). Application of Rasch analysis in the development and application of quality of life instruments. Value in Health, 7(Suppl 1), S22–S26.
Cook, K. F., Gartsman, G. M., Roddey, T. S., & Olson, S. L. (2001). The measurement level and trait-specific reliability of 4 scales of shoulder functioning: an empiric investigation. Archives of Physical Medicine and Rehabilitation, 82(11), 1558–1565.
Chester, R., Jerosch-Herold, C., Lewis, J., & Shepstone, L. (2016). Psychological factors are associated with the outcome of physiotherapy for people with shoulder pain: a multicentre longitudinal cohort study. British Journal of Sports Medicine, https://doi.org/10.1136/bjsports-2016-096084.
Chester, R., Shepstone, L., Lewis, J. S., & Jerosch-Herold, C. (2013). Predicting response to physiotherapy treatment for musculoskeletal shoulder pain: protocol for a longitudinal cohort study. BMC Musculoskeletal Disorders, 14, 192.
Andrich, D. (1978). Rating formulation for ordered response categories. Psychometrika, 43(4), 561–573.
Lundgren Nilsson, A., & Tennant, A. (2011). Past and present issues in Rasch analysis: the functional independence measure (FIM) revisited. Journal of Rehabilitation Medicine, 43(10), 884–891.
Pallant, J. F., & Tennant, A. (2007). An introduction to the Rasch measurement model: an example using the hospital anxiety and depression scale (HADS). The British journal of clinical Psychology / The British Psychological Society, 46(Pt 1), 1–18.
Smith, E. V. (2002). Jr. Detecting and evaluating the impact of multidimensionality using item fit statistics and principal component analysis of residuals. Journal of Applied Measurement, 3(2), 205–231.
Hobart, J., & Cano, S. (2009). Improving the evaluation of therapeutic interventions in multiple sclerosis: the role of new psychometric methods. Health Technology Assessment, 13(12), 1–177.
Andrich, D., Humphry, S. M., & Marais, I. (2012). Quantifying local, response dependence between two polytomous items using the Rasch Model. Applied Psychological Measurement, 36(4), 309–324.
Andrich, D., & Hagquist, C. (2015). Real and artificial differential item functioning in polytomous items. Educational and Psychological Measurement, 75(2), 185–207.
Bland, J. M., & Altman, D. G. (1995). Multiple significance tests—the Bonferroni method. British Medical Journal, 310(6973), 170.
Fischer, W. J. (1992). Reliability statistics. Rasch Measurement Transactions, 6(3), 238.
Tveita, E. K., Sandvik, L., Ekeberg, O. M., Juel, N. G., & Bautz-Holter, E. (2008). Factor structure of the shoulder pain and disability index in patients with adhesive capsulitis. BMC Musculoskeletal Disorders, 9, 103.
Shea, T. L., Tennant, A., & Pallant, J. F. (2009). Rasch model analysis of the depression, anxiety and stress scales (DASS). BMC Psychiatry, 9, 21.
Racine, M., Tousignant-Laflamme, Y., Kloda, L. A., Dion, D., Dupuis, G., & Choiniere, M. (2012). A systematic literature review of 10 years of research on sex/gender and experimental pain perception - Part 1: Are there really differences between women and men? Pain, 153(3), 602–618.
Racine, M., Tousignant-Laflamme, Y., Kloda, L. A., Dion, D., Dupuis, G., & Choiniere, M. (2012). A systematic literature review of 10 years of research on sex/gender and pain perception—Part 2: Do biopsychosocial factors alter pain sensitivity differently in women and men? Pain, 153(3), 619–635.
Michener, L. A., Snyder, A. R., & Leggin, B. G. (2011). Responsiveness of the numeric pain rating scale in patients with shoulder pain and the effect of surgical status. Journal of Sport Rehabilitation, 20(1), 115–128.
Norman, D., & Streiner, G. (2003). Health Measurement scales: A practical guide to their development and use (3rd edn.). Oxford: Oxford University Press.
Raman, J., MacDermid, J. C., Walton, D., & Athwal, G. S. (2017). Rasch analysis indicates that the Simple Shoulder Test is robust, but minor item modifications and attention to gender differences should be considered. Journal of Hand Therapy, 30, 348–358.
Packham, T., & MacDermid, J. C. (2013). Measurement properties of the patient-rated wrist and hand evaluation: Rasch analysis of responses from a traumatic hand injury population. Journal of Hand Therapy, 26(3), 216–224.
Johnson, J. L., Greaves, L., & Repta, R. (2009). Better science with sex and gender: Facilitating the use of a sex and gender-based analysis in health research. International Journal for Equity, 8, 14.
Patrick, D. L., Burke, L. B., Gwaltney, C. J., Leidy, N. K., Martin, M. L., Molsen, E., et al. (2011). Content validity-establishing and reporting the evidence in newly developed patient-reported outcomes (PRO) instruments for medical product evaluation: ISPOR PRO good research practices task force report: Part 1-eliciting concepts for a new PRO instrument. Value in Health, 14(8), 967–977.
Patrick, D. L., Burke, L. B., Gwaltney, C. J., Leidy, N. K., Martin, M. L., Molsen, E., et al. (2011). Content validity-establishing and reporting the evidence in newly developed patient-reported outcomes (PRO) instruments for medical product evaluation: ISPOR PRO good research practices task force report: Part 2-assessing respondent understanding. Value in Health, 14(8), 978–988.
Hagell, P., & Westergren, A. (2016). Sample size and statistical conclusions from tests of fit to the Rasch model according to the Rasch unidimensional measurement model (Rumm) program in health outcome measurement. Journal of Applied Measurement, 17(4), 416–431.
Smith, A. B., Rush, R., Fallowfield, L. J., Velikova, G., & Sharpe, M. (2008). Rasch fit statistics and sample size considerations for polytomous data. BMC Medical Research Methodology, 8, 33.
Funding
CJH and RC were funded by the National Institute for Health Research (NIHR Senior Research Fellowship and NIHR Clinical Doctoral Research Fellowship, respectively). The funders of the study had no role in study design, data collection, data analysis, data interpretation or writing of the report. The views and opinions expressed therein are those of the authors and do not necessarily reflect those of the NIHR, NHS or the Department of Health. The authors certify that they have no affiliations with or financial involvement in any organisation or entity with a direct financial interest in the subject matter or materials discussed in the article. Funding was provided by Research Trainees Coordinating Centre (Grant Nos. SRF-2012-05-119 and CAT-CDRF 10-008).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflicts of interest.
Ethical approval
This paper is based on a secondary analysis of data. The original study was approved by the National Research Ethics Service, East of England - Norfolk, UK, July 2011 (Reference 11/EE/0212). All procedures performed in the study involving human participants were in accordance with the ethical standards of the 1964 Helsinki Declaration and its later amendments or comparable ethical standards.
Rights and permissions
About this article
Cite this article
Jerosch-Herold, C., Chester, R., Shepstone, L. et al. An evaluation of the structural validity of the shoulder pain and disability index (SPADI) using the Rasch model. Qual Life Res 27, 389–400 (2018). https://doi.org/10.1007/s11136-017-1746-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11136-017-1746-7