skip to main content
10.1145/3290605.3300299acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
research-article
Open Access
Honorable Mention

Pictorial System Usability Scale (P-SUS): Developing an Instrument for Measuring Perceived Usability

Published:02 May 2019Publication History

ABSTRACT

We have developed a pictorial multi-item scale, called P-SUS (Pictorial System Usability Scale), which aims to measure the perceived usability of mobile devices. The scale is based on the established verbal usability questionnaire SUS (System Usability Scale). A user-centred design process was employed to develop and refine its 10 pictorial items. The scale was tested in a first validation study (N=60) using student participants. Psychometric properties (convergent validity, criterion-related validity, sensitivity, and reliability), as well as the motivation to fill in the scale were assessed. The results indicated satisfactory convergent validity for about two-thirds of the items. Furthermore, strong correlations were obtained for the sum scores between verbal and pictorial SUS, and the pictorial scale was perceived as more motivating than the verbal questionnaire. The P-SUS represents a first attempt to provide a pictorial usability scale for the evaluation of (mobile) devices.

Skip Supplemental Material Section

Supplemental Material

paper069.mp4

mp4

334.7 MB

References

  1. Leonardo Angelini, Francesco Carrino, Stefano Carrino, Maurizio Caon, Omar Abou Khaled, Jurgen Baumgartner, Andreas Sonderegger, Denis Lalanne, and Elena Mugellini. 2014. Gesturing on the Steering Wheel: a User-elicited taxonomy. In Proceedings of the 6th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, 1--8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Aaron Bangor, Philip Kortum, and James Miller. 2009. Determining what individual SUS scores mean: Adding an adjective rating scale. Journal of usability studies 4, 3: 114--123. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Aaron Bangor, Philip T. Kortum, and James T. Miller. 2008. An empirical evaluation of the system usability scale. Intl. Journal of Human--Computer Interaction 24, 6: 574--594.Google ScholarGoogle ScholarCross RefCross Ref
  4. Juergen Baumgartner, Andreas Sonderegger, and Juergen Sauer. 2019. No need to read: Developing a pictorial single-item scale for measuring perceived usability. International Journal of Human-Computer Studies 122: 78--89.Google ScholarGoogle ScholarCross RefCross Ref
  5. Alberto Betella and Paul F. M. J. Verschure. 2016. The Affective Slider: A Digital Self-Assessment Scale for the Measurement of Human Emotions. PLOS ONE 11, 2: e0148037.Google ScholarGoogle ScholarCross RefCross Ref
  6. Eli Blevis, Youn-kyung Lim, Erik Stolterman, Tracee Vetting Wolf, and Keichi Sato. 2007. Supporting design studio culture in HCI. In CHI '07 extended abstracts on Human factors in computing systems - CHI '07, 2821. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Margaret M. Bradley and Peter J. Lang. 1994. Measuring emotion: the self-assessment manikin and the semantic differential. Journal of behavior therapy and experimental psychiatry 25, 1: 49--59.Google ScholarGoogle ScholarCross RefCross Ref
  8. Margaret M. Bradley and Peter J. Lang. 1994. Measuring emotion: the self-assessment manikin and the semantic differential. Journal of behavior therapy and experimental psychiatry 25, 1: 49--59.Google ScholarGoogle ScholarCross RefCross Ref
  9. Joost Broekens and Willem-Paul Brinkman. 2013. AffectButton: A method for reliable and valid affective self-report. International Journal of Human-Computer Studies 71, 6: 641--667. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. John Brooke. 1996. SUS-A quick and dirty usability scale. Usability evaluation in industry 189, 194: 4--7.Google ScholarGoogle Scholar
  11. John Brooke. 2013. SUS: A Retrospective. J. Usability Studies 8, 2: 29--40. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Heather Buchanan and N. Niven. 2002. Validation of a Facial Image Scale to assess child dental anxiety. International journal of paediatric dentistry 12, 1: 47--52.Google ScholarGoogle Scholar
  13. Kevin Capota, Marco van Hout, and Thea van der Geest. 2007. Measuring the emotional impact of websites: a study on combining a dimensional and discrete emotion approach in measuring visual appeal of university websites. In Proceedings of the 2007 conference on Designing pleasurable products and interfaces, 135--147. Retrieved April 27, 2017 from http://dl.acm.org/citation.cfm?id=1314173 Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Timo Christophersen and Udo Konradt. 2011. Reliability, validity, and sensitivity of a single-item measure of online store usability. International Journal of Human-Computer Studies 69, 4: 269--280. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Cohen. 1988. The effect size. Statistical power analysis for the behavioral sciences: 77--83.Google ScholarGoogle Scholar
  16. Pieter Desmet. 2003. Measuring Emotion: Development and Application of an Instrument to Measure Emotional Responses to Products. In Funology, Mark A. Blythe, Kees Overbeeke, Andrew F. Monk and Peter C. Wright (eds.). Springer Netherlands, 111-- 123.Google ScholarGoogle Scholar
  17. Pieter Desmet, Kees Overbeeke, and Stefan Tax. 2001. Designing products with added emotional value: Development and appllcation of an approach for research through design. The design journal 4, 1: 32--47.Google ScholarGoogle Scholar
  18. Pieter Desmet, Martijn H. Vastenburg, and Natalia Romero. 2016. Mood measurement with Pick-A-Mood: review of current methods and design of a pictorial self-report scale. Journal of Design Research 14, 3: 241--279.Google ScholarGoogle ScholarCross RefCross Ref
  19. Will Eisner. 1985. Theory of Comics and Sequential Art. F.: Poorhouse press.Google ScholarGoogle Scholar
  20. Kraig Finstad. 2010. Response interpolation and scale sensitivity: Evidence against 5-point scales. Journal of Usability Studies 5, 3: 104--110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Dawn P. Flanagan, Samuel O. Ortiz, and Vincent C. Alfonso. 2013. Essentials of cross-battery assessment. John Wiley & Sons.Google ScholarGoogle Scholar
  22. Mirta Galesic and Michael Bosnjak. 2009. Effects of questionnaire length on participation and indicators of response quality in a web survey. Public opinion quarterly 73, 2: 349--360.Google ScholarGoogle Scholar
  23. Gunther Gediga, Kai-Christoph Hamborg, and Ivo Duntsch. 1999. The IsoMetrics usability inventory: An operationalization of ISO 9241--10 supporting summative and formative evaluation of software systems. Behaviour & Information Technology 18, 3: 151--164.Google ScholarGoogle ScholarCross RefCross Ref
  24. Ramesh Ghiassi, Kevin Murphy, Andrew R. Cummin, and Martyn R. Partridge. 2011. Developing a pictorial Epworth Sleepiness Scale. Thorax 66, 2: 97--100.Google ScholarGoogle ScholarCross RefCross Ref
  25. Shamila Haddad, Steve King, Paul Osmond, and Shahin Heidari. 2012. Questionnaire Design to Determine Children's Thermal Sensation, Preference and Acceptability in the Classroom. 7.Google ScholarGoogle Scholar
  26. Kai-Christoph Hamborg, Julia Hulsmann, and Kai Kaspar. 2014. The Interplay between Usability and Aesthetics: More Evidence for the "What Is Usable Is Beautiful" Notion. Advances in HumanComputer Interaction 2014: 1--13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Rex Hartson and Pardha S. Pyla. 2012. The UX Book: Process and guidelines for ensuring a quality user experience. Elsevier. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Timothy R. Hinkin. 1995. A review of scale development practices in the study of organizations. Journal of Management 21, 5: 967--988.Google ScholarGoogle ScholarCross RefCross Ref
  29. Kasper Hornbæk and Effie Lai-Chong Law. 2007. Meta-analysis of Correlations Among Usability Measures. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '07), 617--626. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. International Organization for Standardization. 2010. Ergonomics of human-system interaction - Part 210: Humancentred design for interactive systems (Standard No. 9241--210). Retrieved from https://www.iso.org/standard/52075.htmlGoogle ScholarGoogle Scholar
  31. International Organization for Standardization. 2016. Ergonomics of human-system interaction - Part 11: Usability: Definitions and concepts (Standard No. 9241--11.2). Retrieved from https://www.iso.org/standard/63500.htmlGoogle ScholarGoogle Scholar
  32. Alice M. Isen and Johnmarshall Reeve. 2005. The influence of positive affect on intrinsic and extrinsic motivation: Facilitating enjoyment of play, responsible work behavior, and self-control. Motivation and emotion 29, 4: 295--323.Google ScholarGoogle Scholar
  33. Heike Jungst. 2010. Information Comics--Knowledge Transfer in a Popular Format.Google ScholarGoogle Scholar
  34. Philip Kortum and S. Camille Peres. 2014. The relationship between system effectiveness and subjective usability scores using the System Usability Scale. International Journal of HumanComputer Interaction 30, 7: 575--584.Google ScholarGoogle ScholarCross RefCross Ref
  35. Philip T. Kortum and Aaron Bangor. 2013. Usability ratings for everyday products measured with the System Usability Scale. International Journal of Human-Computer Interaction 29, 2: 67-- 76.Google ScholarGoogle ScholarCross RefCross Ref
  36. Theodore Kunin. 1955. The Construction of a New Type of Attitude Measure. Personnel Psychology 8, 1: 65--77.Google ScholarGoogle ScholarCross RefCross Ref
  37. Clayton Lewis and Robert Mack. 1982. Learning to Use a Text Processing System: Evidence from "Thinking Aloud" Protocols. In Proceedings of the 1982 Conference on Human Factors in Computing Systems (CHI '82), 387--392. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. James (Jim) R. Lewis and Jeff Sauro. 2017. Revisiting the Factor Structure of the System Usability Scale. J. Usability Studies 12, 4: 183--192. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. James R. Lewis. 2002. Psychometric Evaluation of the PSSUQ Using Data from Five Years of Usability Studies. International Journal of Human-Computer Interaction 14, 3--4: 463--488.Google ScholarGoogle ScholarCross RefCross Ref
  40. James R. Lewis. 2018. The System Usability Scale: Past, Present, and Future. International Journal of Human--Computer Interaction 34, 7: 577--590.Google ScholarGoogle ScholarCross RefCross Ref
  41. Bernd Marcus, Michael Bosnjak, Steffen Lindner, Stanislav Pilischenko, and Astrid Schutz. 2007. Compensating for Low Topic Interest and Long Surveys: A Field Experiment on Nonresponse in Web Surveys. Social Science Computer Review 25, 3: 372--383. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. E. McAuley, T. Duncan, and V. V. Tammen. 1989. Psychometric properties of the Intrinsic Motivation Inventory in a competitive sport setting: a confirmatory factor analysis. Research Quarterly for Exercise and Sport 60, 1: 48--58.Google ScholarGoogle ScholarCross RefCross Ref
  43. Samuel Messick. 1979. Test Validity and the Ethics of Assessment. ETS Research Report Series 1979, 1: i--43.Google ScholarGoogle Scholar
  44. Michael Minge and Laura Riedel. 2013. meCUE-Ein modularer Fragebogen zur Erfassung des Nutzungserlebens. In Mensch & Computer, 89--98.Google ScholarGoogle Scholar
  45. Morten Moshagen and Meinald T. Thielsch. 2010. Facets of visual aesthetics. International Journal of Human-Computer Studies 68, 10: 689--709. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Clive Nancarrow and Trixie Cartwright. 2007. Online access panels and tracking research: the conditioning issue. International Journal of Market Research 49, 5: 573--594.Google ScholarGoogle ScholarCross RefCross Ref
  47. Jakob Nielsen and Jonathan Levy. 1994. Measuring Usability: Preference vs. Performance. Commun. ACM 37, 4: 66--75. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. J. C. Nunnally. 1978. Psychometric Theory, Second. New York: McGrawHill.Google ScholarGoogle Scholar
  49. Jum C. Nunnally and I. H. Bernstein. 1994. Psychometric Theory (McGraw-Hill Series in Psychology). McGraw-Hill New York.Google ScholarGoogle Scholar
  50. Sampo V. Paunonen, Michael C. Ashton, and Douglas N. Jackson. 2001. Nonverbal assessment of the Big Five personality factors. European Journal of Personality 15, 1: 3--18.Google ScholarGoogle ScholarCross RefCross Ref
  51. Laura Reynolds-Keefer, Robert Johnson, and S. Carolina. 2011. Is a picture is worth a thousand words? Creating effective questionnaires with pictures. Practical Assessment, Research & Evaluation 16, 8: 1--7.Google ScholarGoogle Scholar
  52. Richard M. Ryan. 1982. Control and information in the intrapersonal sphere: An extension of cognitive evaluation theory. Journal of personality and social psychology 43, 3: 450.Google ScholarGoogle ScholarCross RefCross Ref
  53. Young Sam Ryu and Tonya L. Smith-Jackson. 2006. Reliability and Validity of the Mobile Phone Usability Questionnaire (MPUQ). J. Usability Studies 2, 1: 39--53. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Jeff Sauro. 2011. A practical guide to the system usability scale: Background, benchmarks & best practices. Measuring Usability LLC Denver, CO.Google ScholarGoogle Scholar
  55. Jeff Sauro and James R. Lewis. 2009. Correlations among prototypical usability metrics: evidence for the construct of usability. In Proceedings of the SIGCHI conference on human factors in computing systems, 1609--1618. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Jeff Sauro and James R. Lewis. 2016. Quantifying the user experience: Practical statistics for user research. Morgan Kaufmann. Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Andreas Sonderegger, Klaus Heyden, Alain Chavaillaz, and Juergen Sauer. 2016. AniSAM & AniAvatar: Animated Visualizations of Affective States. 4828--4837. Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. R. A. Stern, J. E. Arruda, C. R. Hooper, G. D. Wolfner, and C. E. Morey. 1997. Visual analogue mood scales to measure internal mood state in neurologically impaired patients: Description and initial validity evidence. Aphasiology 11, 1: 59--71.Google ScholarGoogle ScholarCross RefCross Ref
  59. Noam Tractinsky. 2018. The Usability Construct: A Dead End? Human--Computer Interaction 33, 2: 131--177.Google ScholarGoogle ScholarCross RefCross Ref
  60. Bert Weijters, Elke Cabooter, and Niels Schillewaert. 2010. The effect of rating scale format on response styles: The number of response categories and response category labels. International Journal of Research in Marketing 27, 3: 236--247.Google ScholarGoogle ScholarCross RefCross Ref
  61. Matthias Wilde, Katrin Batz, Anastassiya Kovaleva, and Detlef Urhahne. 2009. Uberprufung einer Kurzskala intrinsischer Motivation (KIM). Zeitschrift für Didaktik der Naturwissenschaften 15.Google ScholarGoogle Scholar
  62. Rudiger Zarnekow and Walter Brenner. 2004. Integriertes Informationsmanagement: Vom Plan, Build, Run zum Source, Make, Deliver. Informationsmanagement: Konzepte und Strategien für die Praxis. dpunkt, Heidelberg: 3--24.Google ScholarGoogle Scholar

Index Terms

  1. Pictorial System Usability Scale (P-SUS): Developing an Instrument for Measuring Perceived Usability

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format