research-article

Open Access

Pictorial System Usability Scale (P-SUS): Developing an Instrument for Measuring Perceived Usability

Authors:
Juergen Baumgartner

University of Fribourg & Puzzle ITC, Fribourg, Switzerland

University of Fribourg & Puzzle ITC, Fribourg, Switzerland
View Profile

,
Naomi Frei

University of Fribourg, Fribourg, Switzerland

University of Fribourg, Fribourg, Switzerland
View Profile

,
Mascha Kleinke

University of Fribourg, Fribourg, Switzerland

University of Fribourg, Fribourg, Switzerland
View Profile

,
Juergen Sauer

University of Fribourg, Fribourg, Switzerland

University of Fribourg, Fribourg, Switzerland
View Profile

,
Andreas Sonderegger

EPFL, Lausanne, Switzerland

EPFL, Lausanne, Switzerland
View Profile

CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing SystemsMay 2019Paper No.: 69Pages 1–11https://doi.org/10.1145/3290605.3300299

Published:02 May 2019Publication History

CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems

Pages 1–11

ABSTRACT

We have developed a pictorial multi-item scale, called P-SUS (Pictorial System Usability Scale), which aims to measure the perceived usability of mobile devices. The scale is based on the established verbal usability questionnaire SUS (System Usability Scale). A user-centred design process was employed to develop and refine its 10 pictorial items. The scale was tested in a first validation study (N=60) using student participants. Psychometric properties (convergent validity, criterion-related validity, sensitivity, and reliability), as well as the motivation to fill in the scale were assessed. The results indicated satisfactory convergent validity for about two-thirds of the items. Furthermore, strong correlations were obtained for the sum scores between verbal and pictorial SUS, and the pictorial scale was perceived as more motivating than the verbal questionnaire. The P-SUS represents a first attempt to provide a pictorial usability scale for the evaluation of (mobile) devices.

Supplemental Material

paper069.mp4

mp4

334.7 MB

Download

References

Leonardo Angelini, Francesco Carrino, Stefano Carrino, Maurizio Caon, Omar Abou Khaled, Jurgen Baumgartner, Andreas Sonderegger, Denis Lalanne, and Elena Mugellini. 2014. Gesturing on the Steering Wheel: a User-elicited taxonomy. In Proceedings of the 6th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, 1--8. Google ScholarDigital Library
Aaron Bangor, Philip Kortum, and James Miller. 2009. Determining what individual SUS scores mean: Adding an adjective rating scale. Journal of usability studies 4, 3: 114--123. Google ScholarDigital Library
Aaron Bangor, Philip T. Kortum, and James T. Miller. 2008. An empirical evaluation of the system usability scale. Intl. Journal of Human--Computer Interaction 24, 6: 574--594.Google ScholarCross Ref
Juergen Baumgartner, Andreas Sonderegger, and Juergen Sauer. 2019. No need to read: Developing a pictorial single-item scale for measuring perceived usability. International Journal of Human-Computer Studies 122: 78--89.Google ScholarCross Ref
Alberto Betella and Paul F. M. J. Verschure. 2016. The Affective Slider: A Digital Self-Assessment Scale for the Measurement of Human Emotions. PLOS ONE 11, 2: e0148037.Google ScholarCross Ref
Eli Blevis, Youn-kyung Lim, Erik Stolterman, Tracee Vetting Wolf, and Keichi Sato. 2007. Supporting design studio culture in HCI. In CHI '07 extended abstracts on Human factors in computing systems - CHI '07, 2821. Google ScholarDigital Library
Margaret M. Bradley and Peter J. Lang. 1994. Measuring emotion: the self-assessment manikin and the semantic differential. Journal of behavior therapy and experimental psychiatry 25, 1: 49--59.Google ScholarCross Ref
Margaret M. Bradley and Peter J. Lang. 1994. Measuring emotion: the self-assessment manikin and the semantic differential. Journal of behavior therapy and experimental psychiatry 25, 1: 49--59.Google ScholarCross Ref
Joost Broekens and Willem-Paul Brinkman. 2013. AffectButton: A method for reliable and valid affective self-report. International Journal of Human-Computer Studies 71, 6: 641--667. Google ScholarDigital Library
John Brooke. 1996. SUS-A quick and dirty usability scale. Usability evaluation in industry 189, 194: 4--7.Google Scholar
John Brooke. 2013. SUS: A Retrospective. J. Usability Studies 8, 2: 29--40. Google ScholarDigital Library
Heather Buchanan and N. Niven. 2002. Validation of a Facial Image Scale to assess child dental anxiety. International journal of paediatric dentistry 12, 1: 47--52.Google Scholar
Kevin Capota, Marco van Hout, and Thea van der Geest. 2007. Measuring the emotional impact of websites: a study on combining a dimensional and discrete emotion approach in measuring visual appeal of university websites. In Proceedings of the 2007 conference on Designing pleasurable products and interfaces, 135--147. Retrieved April 27, 2017 from http://dl.acm.org/citation.cfm?id=1314173 Google ScholarDigital Library
Timo Christophersen and Udo Konradt. 2011. Reliability, validity, and sensitivity of a single-item measure of online store usability. International Journal of Human-Computer Studies 69, 4: 269--280. Google ScholarDigital Library
J. Cohen. 1988. The effect size. Statistical power analysis for the behavioral sciences: 77--83.Google Scholar
Pieter Desmet. 2003. Measuring Emotion: Development and Application of an Instrument to Measure Emotional Responses to Products. In Funology, Mark A. Blythe, Kees Overbeeke, Andrew F. Monk and Peter C. Wright (eds.). Springer Netherlands, 111-- 123.Google Scholar
Pieter Desmet, Kees Overbeeke, and Stefan Tax. 2001. Designing products with added emotional value: Development and appllcation of an approach for research through design. The design journal 4, 1: 32--47.Google Scholar
Pieter Desmet, Martijn H. Vastenburg, and Natalia Romero. 2016. Mood measurement with Pick-A-Mood: review of current methods and design of a pictorial self-report scale. Journal of Design Research 14, 3: 241--279.Google ScholarCross Ref
Will Eisner. 1985. Theory of Comics and Sequential Art. F.: Poorhouse press.Google Scholar
Kraig Finstad. 2010. Response interpolation and scale sensitivity: Evidence against 5-point scales. Journal of Usability Studies 5, 3: 104--110. Google ScholarDigital Library
Dawn P. Flanagan, Samuel O. Ortiz, and Vincent C. Alfonso. 2013. Essentials of cross-battery assessment. John Wiley & Sons.Google Scholar
Mirta Galesic and Michael Bosnjak. 2009. Effects of questionnaire length on participation and indicators of response quality in a web survey. Public opinion quarterly 73, 2: 349--360.Google Scholar
Gunther Gediga, Kai-Christoph Hamborg, and Ivo Duntsch. 1999. The IsoMetrics usability inventory: An operationalization of ISO 9241--10 supporting summative and formative evaluation of software systems. Behaviour & Information Technology 18, 3: 151--164.Google ScholarCross Ref
Ramesh Ghiassi, Kevin Murphy, Andrew R. Cummin, and Martyn R. Partridge. 2011. Developing a pictorial Epworth Sleepiness Scale. Thorax 66, 2: 97--100.Google ScholarCross Ref
Shamila Haddad, Steve King, Paul Osmond, and Shahin Heidari. 2012. Questionnaire Design to Determine Children's Thermal Sensation, Preference and Acceptability in the Classroom. 7.Google Scholar
Kai-Christoph Hamborg, Julia Hulsmann, and Kai Kaspar. 2014. The Interplay between Usability and Aesthetics: More Evidence for the "What Is Usable Is Beautiful" Notion. Advances in HumanComputer Interaction 2014: 1--13. Google ScholarDigital Library
Rex Hartson and Pardha S. Pyla. 2012. The UX Book: Process and guidelines for ensuring a quality user experience. Elsevier. Google ScholarDigital Library
Timothy R. Hinkin. 1995. A review of scale development practices in the study of organizations. Journal of Management 21, 5: 967--988.Google ScholarCross Ref
Kasper Hornbæk and Effie Lai-Chong Law. 2007. Meta-analysis of Correlations Among Usability Measures. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '07), 617--626. Google ScholarDigital Library
International Organization for Standardization. 2010. Ergonomics of human-system interaction - Part 210: Humancentred design for interactive systems (Standard No. 9241--210). Retrieved from https://www.iso.org/standard/52075.htmlGoogle Scholar
International Organization for Standardization. 2016. Ergonomics of human-system interaction - Part 11: Usability: Definitions and concepts (Standard No. 9241--11.2). Retrieved from https://www.iso.org/standard/63500.htmlGoogle Scholar
Alice M. Isen and Johnmarshall Reeve. 2005. The influence of positive affect on intrinsic and extrinsic motivation: Facilitating enjoyment of play, responsible work behavior, and self-control. Motivation and emotion 29, 4: 295--323.Google Scholar
Heike Jungst. 2010. Information Comics--Knowledge Transfer in a Popular Format.Google Scholar
Philip Kortum and S. Camille Peres. 2014. The relationship between system effectiveness and subjective usability scores using the System Usability Scale. International Journal of HumanComputer Interaction 30, 7: 575--584.Google ScholarCross Ref
Philip T. Kortum and Aaron Bangor. 2013. Usability ratings for everyday products measured with the System Usability Scale. International Journal of Human-Computer Interaction 29, 2: 67-- 76.Google ScholarCross Ref
Theodore Kunin. 1955. The Construction of a New Type of Attitude Measure. Personnel Psychology 8, 1: 65--77.Google ScholarCross Ref
Clayton Lewis and Robert Mack. 1982. Learning to Use a Text Processing System: Evidence from "Thinking Aloud" Protocols. In Proceedings of the 1982 Conference on Human Factors in Computing Systems (CHI '82), 387--392. Google ScholarDigital Library
James (Jim) R. Lewis and Jeff Sauro. 2017. Revisiting the Factor Structure of the System Usability Scale. J. Usability Studies 12, 4: 183--192. Google ScholarDigital Library
James R. Lewis. 2002. Psychometric Evaluation of the PSSUQ Using Data from Five Years of Usability Studies. International Journal of Human-Computer Interaction 14, 3--4: 463--488.Google ScholarCross Ref
James R. Lewis. 2018. The System Usability Scale: Past, Present, and Future. International Journal of Human--Computer Interaction 34, 7: 577--590.Google ScholarCross Ref
Bernd Marcus, Michael Bosnjak, Steffen Lindner, Stanislav Pilischenko, and Astrid Schutz. 2007. Compensating for Low Topic Interest and Long Surveys: A Field Experiment on Nonresponse in Web Surveys. Social Science Computer Review 25, 3: 372--383. Google ScholarDigital Library
E. McAuley, T. Duncan, and V. V. Tammen. 1989. Psychometric properties of the Intrinsic Motivation Inventory in a competitive sport setting: a confirmatory factor analysis. Research Quarterly for Exercise and Sport 60, 1: 48--58.Google ScholarCross Ref
Samuel Messick. 1979. Test Validity and the Ethics of Assessment. ETS Research Report Series 1979, 1: i--43.Google Scholar
Michael Minge and Laura Riedel. 2013. meCUE-Ein modularer Fragebogen zur Erfassung des Nutzungserlebens. In Mensch & Computer, 89--98.Google Scholar
Morten Moshagen and Meinald T. Thielsch. 2010. Facets of visual aesthetics. International Journal of Human-Computer Studies 68, 10: 689--709. Google ScholarDigital Library
Clive Nancarrow and Trixie Cartwright. 2007. Online access panels and tracking research: the conditioning issue. International Journal of Market Research 49, 5: 573--594.Google ScholarCross Ref
Jakob Nielsen and Jonathan Levy. 1994. Measuring Usability: Preference vs. Performance. Commun. ACM 37, 4: 66--75. Google ScholarDigital Library
J. C. Nunnally. 1978. Psychometric Theory, Second. New York: McGrawHill.Google Scholar
Jum C. Nunnally and I. H. Bernstein. 1994. Psychometric Theory (McGraw-Hill Series in Psychology). McGraw-Hill New York.Google Scholar
Sampo V. Paunonen, Michael C. Ashton, and Douglas N. Jackson. 2001. Nonverbal assessment of the Big Five personality factors. European Journal of Personality 15, 1: 3--18.Google ScholarCross Ref
Laura Reynolds-Keefer, Robert Johnson, and S. Carolina. 2011. Is a picture is worth a thousand words? Creating effective questionnaires with pictures. Practical Assessment, Research & Evaluation 16, 8: 1--7.Google Scholar
Richard M. Ryan. 1982. Control and information in the intrapersonal sphere: An extension of cognitive evaluation theory. Journal of personality and social psychology 43, 3: 450.Google ScholarCross Ref
Young Sam Ryu and Tonya L. Smith-Jackson. 2006. Reliability and Validity of the Mobile Phone Usability Questionnaire (MPUQ). J. Usability Studies 2, 1: 39--53. Google ScholarDigital Library
Jeff Sauro. 2011. A practical guide to the system usability scale: Background, benchmarks & best practices. Measuring Usability LLC Denver, CO.Google Scholar
Jeff Sauro and James R. Lewis. 2009. Correlations among prototypical usability metrics: evidence for the construct of usability. In Proceedings of the SIGCHI conference on human factors in computing systems, 1609--1618. Google ScholarDigital Library
Jeff Sauro and James R. Lewis. 2016. Quantifying the user experience: Practical statistics for user research. Morgan Kaufmann. Google ScholarDigital Library
Andreas Sonderegger, Klaus Heyden, Alain Chavaillaz, and Juergen Sauer. 2016. AniSAM & AniAvatar: Animated Visualizations of Affective States. 4828--4837. Google ScholarDigital Library
R. A. Stern, J. E. Arruda, C. R. Hooper, G. D. Wolfner, and C. E. Morey. 1997. Visual analogue mood scales to measure internal mood state in neurologically impaired patients: Description and initial validity evidence. Aphasiology 11, 1: 59--71.Google ScholarCross Ref
Noam Tractinsky. 2018. The Usability Construct: A Dead End? Human--Computer Interaction 33, 2: 131--177.Google ScholarCross Ref
Bert Weijters, Elke Cabooter, and Niels Schillewaert. 2010. The effect of rating scale format on response styles: The number of response categories and response category labels. International Journal of Research in Marketing 27, 3: 236--247.Google ScholarCross Ref
Matthias Wilde, Katrin Batz, Anastassiya Kovaleva, and Detlef Urhahne. 2009. Uberprufung einer Kurzskala intrinsischer Motivation (KIM). Zeitschrift für Didaktik der Naturwissenschaften 15.Google Scholar
Rudiger Zarnekow and Walter Brenner. 2004. Integriertes Informationsmanagement: Vom Plan, Build, Run zum Source, Make, Deliver. Informationsmanagement: Konzepte und Strategien für die Praxis. dpunkt, Heidelberg: 3--24.Google Scholar

Index Terms

Pictorial System Usability Scale (P-SUS): Developing an Instrument for Measuring Perceived Usability
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. HCI design and evaluation methods

Recommendations

Questionnaire experience of the pictorial usability inventory (PUI) – a comparison of pictorial and hybrid usability scales
Highlights
- This article compares pictorial, hybrid and verbal usability questionnaires using classical psychometric parameters and respondent-centred measures.
- A recently developed questionnaire measuring ‘questionnaire experience’ was used to ...
Abstract
In recent years, alternative types of usability questionnaires using graphical elements (pictorial scales) or a combination of graphical and verbal elements (hybrid scales) have been introduced. Previous research indicates that these ...
Read More
Pictorial usability inventory (PUI): a pilot study
MuC '20: Proceedings of Mensch und Computer 2020

The Pictorial Usability Inventory (PUI) is an image-based instrument that was designed for the assessment of perceived usability of smartphone apps. The aim was to provide users with an easy and intuitive access to the evaluation of perceived usability. ...
Read More
Revisiting the factor structure of the system usability scale

In 2009, we published a paper in which we showed how three independent sources of data indicated that, rather than being a unidimensional measure of perceived usability, the System Usability Scale apparently had two factors: Usability (all items except ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems
May 2019
9077 pages
ISBN:9781450359702
DOI:10.1145/3290605
General Chairs:
Stephen Brewster
University of Glasgow, Scotland, UK
,
Geraldine Fitzpatrick
TU Wien, Austria
,
Program Chairs:
Anna Cox
University College London, UK
,
Vassilis Kostakos
University of Melbourne, Australia
Copyright © 2019 Owner/Author
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike International 4.0 License.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 May 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Badges
- Honorable Mention
Author Tags
consumer product
mobile device evaluation
perceived usability
pictorial scale
Qualifiers
- research-article
Conference

Acceptance Rates
CHI '19 Paper Acceptance Rate703of2,958submissions,24%Overall Acceptance Rate6,199of26,314submissions,24%
More
Upcoming Conference
CHI PLAY '24

Sponsor:

sigchi

The Annual Symposium on Computer-Human Interaction in Play

October 14 - 17, 2024

Tampere , Finland
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 15
  Total Citations
  View Citations
- 2,463
  Total Downloads
- Downloads (Last 12 months)453
- Downloads (Last 6 weeks)78
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Pictorial System Usability Scale (P-SUS): Developing an Instrument for Measuring Perceived Usability

CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Questionnaire experience of the pictorial usability inventory (PUI) – a comparison of pictorial and hybrid usability scales

Pictorial usability inventory (PUI): a pilot study

Revisiting the factor structure of the system usability scale