Abstract
Using the technique of multidimensional scaling the paper demonstrates a method of visualizing a configuration of classes as it is perceived by a classifier. The methodology serves to assist the analysis of multi-class classification problems, where the final result of averaged accuracy or averaged error is not sufficient. The approach may be used to control and tune different classifiers applied to the same data set or a single classifier applied to different data sets. The results of such analyses may then be used for identifying the combinations of classes that proved to be worst recognized.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bradley A. P (1997) ‘The use of the area under the ROC curve in the evaluation of machine learning algorithms’. Pattern Recognition, 30 (7), 1145–1159.
Buckland M. K., Gey F. (1994) ‘The relationship between Recall and Precision’. Journal of the American Society for Information Science, 45 (1), 12–19.
Diettrich T.G., Bakiri G. (1995) ‘Solving multiclass learning problems via error-correcting output codes’. Journal of Artificial Intelligence Research, 2, 263–286.
Egan J. P. (1975) Signal Detection Theory and ROC Analysis. Series in Cognitition and Perception. Academic Press, New York.
Friendly M. (1994) ‘Mosaic displays for multi-way contingency tables’. Journal of the American Statistical Association, 89, 190–200.
Gordon M. D., Kochen M. (1989) ‘Recall-precision trade-off: a derivation’. Journal of the American Society for Information Science, 40, 145–151.
Hanley J. A., McNeil B. J. (1982) ‘The meaning and use of the area under a receiver operating characteristic (ROC) curve’. Radiology, 143, 29–36.
Hartigan J.A., Kleiner B. (1981) ‘Mosaics for contingency tables’. Computer Science and Statistics: Proc. of the 13th Symposium on the Interface, 268–273.
Hofmann H. (2000) ‘Exploring categorical data: interactive mosaic plots’. Metrika, 51 (1), 11–26.
Kohonen T. (1995) Self-Organizing Maps. Springer-Heidelberg Verlag.
Kruskal J. B. (1964) ‘Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis’. Psychometrika, 29, 1–27.
Lukasik E., Susmaga R. (2003) Phoneme, gender and speaker variability visualization in voiceless stop consonants. Proc. of the IEEE Signal Processing Workshop (Circuits and Systems), Poznan, Poland.
Provost F., Fawcett T. (1997) ‘Analysis and visualization of classifier performance: Comparison under imprecise class and cost distributions’. Proc. of the Third International Conference on Knowledge Discovery and Data Mining, Menlo Park, CA, AAAI Press, 43–48.
Sammon J. W. Jr. (1969) ‘A nonlinear mapping for data structure analysis’. IEEE Transactions on Computers, 18, 401–409.
Schiffman S. (1981) Introduction to Multidimensional Scaling: Theory, Methods, and Applications. San Diego and London Academic Press.
Stone M. (1974) ‘Cross-validatory choice and assessment of statistical predictions’. Journal of the Royal Statistical Society, 36, 111–147.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Susmaga, R. (2004). Confusion Matrix Visualization. In: Kłopotek, M.A., Wierzchoń, S.T., Trojanowski, K. (eds) Intelligent Information Processing and Web Mining. Advances in Soft Computing, vol 25. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39985-8_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-39985-8_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21331-4
Online ISBN: 978-3-540-39985-8
eBook Packages: Springer Book Archive