Skip to main content

Optimizing Scatterplot-Matrices for Decision-Support:

An Experimental Eye-Tracking Study Assessing Situational Cognitive Load

  • Conference paper
  • First Online:
Information Systems and Neuroscience (NeuroIS 2021)

Part of the book series: Lecture Notes in Information Systems and Organisation ((LNISO,volume 52))

Included in the following conference series:

Abstract

The scatterplot matrix is defined to be a standard method for multivariate data visualization; nonetheless, their use for decision-support in a corporate environment is scarce. Amongst others, longstanding criticism lies in the lack of empirical testing to investigate optimal design specifications as well as areas of application from a business related perspective. Thus, on the basis of an innovative approach to assess a visualization’s fitness for efficient and effective decision-making given a user’s situational cognitive load, this study investigates the usability of a scatterplot matrix while performing typical tasks associated with multidimensional datasets (correlation and distribution assessment). A laboratory experiment recording eye-tracking data investigates the design of the matrix and its influence on the decision-maker’s ability to process the presented information. Especially, the information content presented in the diagonal as well as the size of the matrix are tested and linked to the user’s individual processing capabilities. Results show that the design of the scatterplot as well as the size of the matrix influenced the decision-making greatly.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 119.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 159.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bačić, D., Fadlalla, A.: Business information visualization intellectual contributions: an integrative framework of visualization capabilities and dimensions of visual intelligence. Decis. Support Syst. 89, 77–86 (2016)

    Article  Google Scholar 

  2. Appelbaum, D., Kogan, A., Vasarhelyi, M., et al.: Impact of business analytics and enterprise systems on managerial accounting. Int. J. Account. Inf. Syst. 25, 29–44 (2017)

    Article  Google Scholar 

  3. Perkhofer, L.M., Hofer, P., Walchshofer, C., et al.: Interactive visualization of big data in the field of accounting. J. Appl. Account. Res. 5, 78 (2019)

    Google Scholar 

  4. Perkhofer, L., Walchshofer, C., Hofer, P.: Does design matter when visualizing big data? An empirical study to investigate the effect of visualization type and interaction use. J. Manag. Control. 31(1–2), 55–95 (2020). https://doi.org/10.1007/s00187-020-00294-0

    Article  Google Scholar 

  5. Bertini, E., Tatu, A., Keim, D.A.: Quality metrics in high-dimensional data visualization: an overview and systematization. IEEE Trans. Vis Comput. Graph. 17, 2203–2212 (2011)

    Article  Google Scholar 

  6. Keim, D.A.: Information visualization and visual data mining. IEEE Trans. Vis. Comput. Graph. 8, 1–8 (2002)

    Article  Google Scholar 

  7. Inselberg, A.: The plane with parallel coordinates. Vis. Comput. 1, 69–91 (1985). https://doi.org/10.1007/BF01898350

    Article  Google Scholar 

  8. Isenberg, T., Isenberg, P., Chen, J., et al.: A systematic review on the practice of evaluating visualization. IEEE Trans. Vis. Comput. Graph. 19, 2818–2827 (2013)

    Article  Google Scholar 

  9. Perkhofer, L., Walchshofer, C., Hofer, P.: Designing visualizations to identify and assess correlations and trends: an experimental study based on price developments. In: Lehner, O. (ed.) Proceedings of the 17th Conference on Finance, Risk and Accounting Perspectives. ACRN Oxford, pp. 294–340 (2019)

    Google Scholar 

  10. Ware, C.: Information Visualization: Perception for Design , 3rd edn. Elsevier Ltd., Oxford (2012)

    Google Scholar 

  11. Tufte, E.R.: The Visual Display of Quantitative Information, 1st edn. Graphics Press, Connecticut (1983)

    Google Scholar 

  12. Perkhofer, L.: A cognitive load-theoretic framework for information visualization. In: Lehner, O. (ed.) Proceedings of the 17th Conference on Finance, Risk and Accounting Perspectives, in Print. ACRN Oxford, pp. 9–25 (2019)

    Google Scholar 

  13. Perkhofer, L., Lehner, O.: Using gaze behavior to measure cognitive load. In: Davis, F.D., Riedl, R., vom Brocke, J., Léger, P.-M., Randolph, A.B. (eds.) Information Systems and Neuroscience. LNISO, vol. 29, pp. 73–83. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-01087-4_9

    Chapter  Google Scholar 

  14. Brücken, R., Plass, J.L., Detlev, L.: Direct measurement of cognitive load in multimedia learning. Educ. Psychol. 38, 53–61 (2003)

    Article  Google Scholar 

  15. Sarikaya, A., Gleicher, M.: Scatterplots: tasks, data, and designs. IEEE Trans. Vis. Comput. Graph. 24, 402–412 (2018)

    Article  Google Scholar 

  16. Li, J., Martens, J.-B., van Wijk, J.J.: Judging correlation from scatterplots and parallel coordinate plots. Inf. Visual. 1, 69 (2008)

    Google Scholar 

  17. Li, J., Martens, J.-B., van Wijk, J.J.: A model of symbol size discrimination in scatterplots. In: Mynatt, E.D., Hudson, S.E., Fitzpatrick, G. (eds.) CHI Conference, pp. 2553–2562. Association for Computing Machinery, New York (2010)

    Google Scholar 

  18. Kanjanabose, R., Abdul-Rahman, A., Chen, M.: A multi-task comparative study on scatter plots and parallel coordinates plots. Comput. Graph. Forum 34, 261–270 (2015)

    Article  Google Scholar 

  19. Netzel, R., Vuong, J., Engelke, U., et al.: Comparative eye-tracking evaluation of scatterplots and parallel coordinates. Vis. Inform. 1, 118–131 (2017)

    Article  Google Scholar 

  20. Nguyen, Q.V., Simoff, S., Qian, Y., et al.: Deep exploration of multidimensional data with linkable scatterplots. In: Zhang, K., Kerren, A. (eds.) Proceedings of the 9th International Symposium on Visual Information Communication and Interaction, pp. 43–50. ACM, New York (2016)

    Google Scholar 

  21. Elmqvist, N., Dragicevic, P., Fekete, J.-D.: Rolling the dice: multidimensional visual exploration using scatterplot matrix navigation. IEEE Trans. Vis. Comput. Graph. 14, 1141–1148 (2008)

    Google Scholar 

  22. Urribarri, D.K., Castro, S.M.: Prediction of data visibility in two-dimensional scatterplots. Inf. Vis. 16, 113–125 (2016)

    Article  Google Scholar 

  23. Elmqvist, N., Moere, V.A., Jetter, H.-C., et al.: Fluid interaction for information visualization. Inf. Vis. 10, 327–340 (2011)

    Article  Google Scholar 

  24. Falschlunger, L., Lehner, O., Treiblmaier, H., et al.: Visual representation of information as an antecedent of perceptive efficiency: the effect of experience. In: Proceedings of the 49th Hawaii International Conference on System Sciences (HICSS), pp. 668–676. IEEE (2016)

    Google Scholar 

  25. Zagermann, J., Pfeil, U., Reiterer, H.: Measuring cognitive load using eye tracking technology in visual computing. In: Sedlmair, M., Isenberg, P., Isenberg, T., et al. (eds.) Proceedings of the Beyond Time and Errors on Novel Evaluation Methods for Visualization - BELIV 2016, pp. 78–85. ACM Press, New York (2016)

    Google Scholar 

  26. Toker, D., Conati, C., Steichen, B., et al.: Individual user characteristics and information visualization: connecting the dots through eye tracking. In: Proceedings of the 31st CHI - Changing Perspectives, pp. 295–304. ACM, New York (2013)

    Google Scholar 

  27. Granholm, E., Asarnow, R.F., Sarkin, A.J., et al.: Pupillary responses index cognitive resources limitations. Psychophysiology 33, 457–461 (1996)

    Article  Google Scholar 

  28. Hossain, G., Yeasin, M.: Understanding effects of cognitive load from pupillary responses using Hilbert analytic phase. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 381–386. IEEE (2014)

    Google Scholar 

  29. Renshaw, J.A., Finlay, J.E., Tyfa, D., et al.: Designing for visual influence: an eye tracking study of the usability of graphical management information. In: Rauterberg, M., Menozzi, M., Wesson. J. (eds.) Proceedings of the INTERACT 2003, pp. 144–151. IOS Press (2003)

    Google Scholar 

  30. Steichen, B., Carenini, G., Conati, C.: User-adaptive information visualization: Using eye gaze data to infer visualization tasks and user cognitive abilities. In: Proceedings of the IUI 2013, pp. 317–328. ACM, New York (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lisa Perkhofer .

Editor information

Editors and Affiliations

Appendix

Appendix

1.1 Appendix A: Pre-study 1 – Scatterplot Design

Study Design:

Participants had to first assess the type of correlation (positive, negative of no correlation) and then the correlation’s magnitude on a scale from 0 to 100 for positive correlations and a scale from -100 to 0 for negative correlations. Participants were recruited on Amazon Mechanical Turk (50 per design). For each design question, a different experiment was created keeping the experiment short and simple, as recommended for online experiments. Each experiment included scatterplots showing two strong positive (r > 0,8 and r > 0,6) as well as two strong negative correlations (r < -0,6, r > −0,8), two weak positive (r > 0,2) and two weak negative correlations (r < −0,2), as well as four scatterplots showing no correlation (−0,2 < r > 0,2).

Procedure:

Participants were asked to perform an online experiment using Lime Survey. The experiment started by explaining how to assess correlations by reading scatterplots. To check for the participant’s attention, they had to answer six questions on the presented content. Experimental tasks were presented in randomized order. Assessment was based on TA (task accuracy) which is measured by using the deviation of the participant’s estimate to the true score of the presented correlation. Lastly, a self-assessment on previous experience and question on the demography of the participants were presented.

Design Question 1 and Stimuli Material:

Should a regression-line be used to indicate the direction and the strength of the correlation coefficient? (Fig. 3).

Fig. 3.
figure 3

Stimuli material design question 1 (left: default; right: trendline)

Design Question 2 and Stimuli Material:

Should a monochrome or a multi-color scheme be used to differentiate identified data clusters? (Fig. 4).

Fig. 4.
figure 4

Stimuli material design question 2 (left: default, right color scheme mono-color)

Results:

Table 6 demonstrates, that assessing positive correlations is easier than assessing negative ones. Further, it is clearly indicated that a regression-line stressing the type (positive or negative) and the magnitude of the correlation increases its assessment. With respect to the color scheme used, no clear evidence in terms of TA could be found, nonetheless, participants preferred the multi-color scheme.

Table 6. Overview results pre-study 1

The results of this study are applied to design the scatterplots used in the scatterplot matrix in pre-study 2 as well as the main experiment presented in this paper.

1.2 Appendix B: Pre-study 2 – Scatterplot Matrix Design

Study Design:

The same five questions and the same data set were used in the pre-study as in the main experiment. Participants (N = 20), recruited at the University of Applied Sciences Upper Austria, educated in business administration were recruited but for this study a different group was used compared to the main experiment. Stimuli material was similar, but targeted towards answering two particular design question: (1) the design of the diagonal (a scatterplot vs. using a histogram) and (2) the design of the bottom left corner (scatterplots vs. the corresponding correlation coefficient).

Procedure:

Participants were asked to perform an online experiment, which was sent out with a link and designed using Lime Survey. The experiment started by explaining how to assess correlations and how to derive information on the distribution of a variable by reading scatterplots and histograms. To check for the participant’s attention, they had to answer questions on the presented content (six concerning a single scatterplot and how to assess correlations; four concerning the distribution of a variable; and four concerning the scatterplot matrix). After informing participants on the content of the study, experimental tasks were presented in randomized order. Assessment is based on TA (task accuracy) and TT (task time). Lastly, a self-assessment on previous experience and question on the demography of the participants were presented.

As experience is of utmost importance when analyzing information in visualized form and self-assessment has proven to be inefficient in the past [24], the experiment was repeated one week after the first trial to check for improvement through learning and repetition. One student did not participate in the second trial and one student did not pass the validity questions at the beginning of the experiment and thus, these two were excluded from analysis.

Design Question 1 and Stimuli Material:

Should the area beneath the diagonal (bottom left) also include scatterplots or should the respective correlation coefficient be presented instead? In addition to the correlation coefficient, also a heatmap representing the strength of the correlation was included as it has proven to increase efficiency and effectiveness in tabular representations (Fig. 5).

Fig. 5.
figure 5

Stimuli material design question 1

Results:

Tasks targeted towards assessing one specific correlation as well as counting the amount of positive or negative correlations presented within the matrix (tasks 1–3) were significantly better supported by using the second design (correlation coefficient in the bottom left corner). TA was significantly higher when correlation coefficients were presented. Additionally, the effect size (eta2) is stronger for the tasks targeted towards counting the amount of positive and negative correlations. Supporting previous research, showing correlation coefficients is more important for counting negative correlations, as those are harder to assess (see task 3).

Further, TA was higher in the second trial indicating participants did increase their effectiveness with repetition. As expected, no effects were visible for the tasks targeted towards assessing the variable’s distribution (tasks 4–5). TT shows significance with respect to the repetition of the trial for all posed tasks, but no direct effect can be detected concerning the design of the area in the bottom left corner. However, when analyzing the significant interaction effects, design 2 increased the participant’s efficiency in the second trial indicating better decision-support after getting used to its design (Table 7).

Table 7. Overview on repeated measures ANOVA on matrix design (bottom left)

Design Question 2 and Stimuli Material:

The second design questions was targeted towards identifying the best information to be presented in the diagonal. Identified options form previous literature as well as published options on visualization platforms highlight three possibilities (Fig. 6):

Fig. 6.
figure 6

Stimuli material design question 2 (left: scatterplot; middle: histogram including normal distribution curve; right: histogram on identified clusters within the dataset)

Results:

For the tasks targeted towards assessing a variable’s distribution, no direct effect on TA could be found. Only when examining interaction effects, differences on the improvement based on repetition can be found: performance using the first histogram design (hole dataset and normal distribution curve) improved, while performance using the second design decreased. Using a correlation coefficient shows the lowest TA. With respect to TT, the histogram representing cluster information significantly increased efficiency for task 4, which asked for an assessment of cluster information (Table 8).

Table 8. Overview on repeated measures ANOVA on design of the diagonal

1.3 Appendix C: Second-Order Formative Construct – Situational CL

For analysis of the second order formative construct we used SmartPLS. We applied the repeated measures approach to analyze the latent variable of situational cognitive load, while we consider balanced indicators for the first order constructs as suggested by Perkhofer and Lehner [13] (Fig. 7).

Fig. 7.
figure 7

Second-order formative construct to measure situational CL

Situational cognitive load is calculated during information processing, using an adaptive baseline to focus not only on the user and his/her capabilities but also on the presented stimulus. The variable is supposed to indicate a mismatch between the presented visualization and the needs of the decision maker, as it indicates increased cognitive resources by focusing on the duration of events [25]. Saccade and blink duration are suspected to be decelerated as the cognitive burden increases, and the size of pupil diameter is said to increase [13, 14, 26,27,28]. Further, by considering measures on blink and saccade count, also the need to search through bad design (not instantly locating the information) is accounted for [26, 29, 30]. Both, information on count measures (highly correlated with time) and information on the duration of events (correlated with the complexity of the material and task accuracy), explain situational cognitive load and thus increase understanding of good or bad design depending on the decision-maker and his/her capabilities.

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Perkhofer, L., Hofer, P. (2021). Optimizing Scatterplot-Matrices for Decision-Support:. In: Davis, F.D., Riedl, R., vom Brocke, J., Léger, PM., Randolph, A.B., Müller-Putz, G. (eds) Information Systems and Neuroscience. NeuroIS 2021. Lecture Notes in Information Systems and Organisation, vol 52. Springer, Cham. https://doi.org/10.1007/978-3-030-88900-5_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-88900-5_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-88899-2

  • Online ISBN: 978-3-030-88900-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics