Robustness Analysis of Eleven Linear Classifiers in Extremely High–Dimensional Feature Spaces

Lausser, Ludwig; Kestler, Hans A.

doi:10.1007/978-3-642-12159-3_7

Ludwig Lausser²¹ &
Hans A. Kestler^21,22

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5998))

Included in the following conference series:

IAPR Workshop on Artificial Neural Networks in Pattern Recognition

1102 Accesses
5 Citations

Abstract

In this study we address the linear classification of noisy high-dimensional data in a two class scenario. We assume that the cardinality of the data is much lower than its dimensionality. The problem of classification in this setting is intensified in the presence of noise. Eleven linear classifiers were compared on two-thousand-one-hundred-and-fifty artificial datasets from four different experimental setups, and five real world gene expression profile datasets, in terms of classification accuracy and robustness. We specifically focus on linear classifiers as the use of more complex concept classes would make over-adaptation even more likely. Classification accuracy is measured by mean error rate and mean rank of error rate. These criteria place two large margin classifiers, SVM and ALMA, and an online classification algorithm called PA at the top, with PA being statistically different from SVM on the artificial data. Surprisingly, these algorithms also outperformed statistically significant all classifiers investigated with dimensionality reduction.

Download to read the full chapter text

Chapter PDF

Integrated Classifier: A Tool for Microarray Analysis

Robust linear classification from limited training data

Article 18 November 2021

Deepayan Chakrabarti

Robust Classification of High-Dimensional Data Using Data-Adaptive Energy Distance

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Breiman, L., Friedman, J., Stone, C., Olshen, R.: Classification and Regression Trees. Chapman & Hall/CRC (1984)
Google Scholar
Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)
MATH Google Scholar
Rojas, R.: Neural Networks: A Systematic Introduction. Springer, Heidelberg (1996)
Google Scholar
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, New York (2001)
MATH Google Scholar
Pearson, K.: On lines and planes of closest fit to systems of points in space. Philosophical Magazine 2(6), 559–572 (1901)
Google Scholar
Ans, B., Hérault, J., Jutten, C.: Adaptive neural architectures: Detection of primitives. In: Proceedings of COGNITIVA 1985, pp. 593–597 (1985)
Google Scholar
Lu, J., Plataniotis, K., Venetsanopoulos, A.: Face recognition using LDA-based algorithms. IEEE Transactions on Neural Networks 14(1), 195–200 (2003)
Article Google Scholar
Zolnay, A., Kocharov, D., Schlüter, R., Ney, H.: Using multiple acoustic feature sets for speech recognition. Speech Commun. 49(6), 514–525 (2007)
Article Google Scholar
Buchholz, M., Kestler, H.A., Bauer, A., et al.: Specialized DNA arrays for the differentiation of pancreatic tumors. Clinical Cancer Research 11(22), 8048–8054 (2005)
Article Google Scholar
Cover, T.M.: Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition. IEEE Transactions on Electronic Computers 14(3), 326–334 (1965)
Article MATH Google Scholar
Tibshirani, R., Hastie, T., Narasimhan, B., Chu, G.: Diagnosis of multiple cancer types by shrunken centroids of gene expression. PNAS 99(10), 6567–6572 (2002)
Article Google Scholar
Bhattacharyya, C., Grate, L.R., Rizki, A., et al.: Simultaneous classification and relevant feature identification in high-dimensional spaces: application to molecular profiling data. Signal Process. 83(4), 729–743 (2003)
Article MATH Google Scholar
Veenman, C.J., Tax, D.M.: Less: A model-based classifier for sparse subspaces. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(9), 1496–1500 (2005)
Article Google Scholar
Rosenblatt, F.: The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain. Psych. Rev. 65(6), 386–407 (1958)
Article MathSciNet Google Scholar
Gentile, C.: A new approximate maximal margin classification algorithm. Journal of Machine Learning Research 2 (2001)
Google Scholar
Li, Y., Long, P.M.: The Relaxed Online Maximum Margin Algorithm. Machine Learning 46(1-3), 361–387 (2002)
Article MATH Google Scholar
Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., Singer, Y.: Online Passive-Aggressive Algorithms. Journal of Machine Learning Research 7, 551–585 (2006)
MathSciNet Google Scholar
Bittner, M., Meltzer, P., Chen, Y., et al.: Molecular classification of cutaneous malignant melanoma by gene expression profiling. Nature 406(6795), 536–540 (2000)
Article Google Scholar
Golub, T., Slonim, D., Tamayo, P., et al.: Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring. Science 286(5439), 531–537 (1999)
Article Google Scholar
Notterman, D., Alon, U., Sierk, A., Levine, A.: Transcriptional Gene Expression Profiles of Colorectal Adenoma, Adenocarcinoma, and Normal Tissue Examined by Oligonucleotide Arrays. Cancer Research 61(7), 3124–3130 (2001)
Google Scholar
West, M., Blanchette, C., Dressman, H., et al.: Predicting the clinical status of human breast cancer by using gene expression profiles. PNAS 98(20), 11462–11467 (2001)
Article Google Scholar
Raudys, S., Duin, R.: Expected classification error of the fisher linear classifier with pseudo-inverse covariance matrix. Pattern Recognition Letters 19(5), 385–392 (1998)
Article MATH Google Scholar
Dougherty, E.R.: Feature-selection overfitting with small-sample classifier design. IEEE Intelligent Systems 20(6), 64–66 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Internal Medicine I, University Hospital Ulm, Germany
Ludwig Lausser & Hans A. Kestler
Institute of Neural Information Processing, University of Ulm, Germany
Hans A. Kestler

Authors

Ludwig Lausser
View author publications
You can also search for this author in PubMed Google Scholar
Hans A. Kestler
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Neural Information Processing, Oberer Eselsberg, University of Ulm, 89069, Ulm, Germany
Friedhelm Schwenker
Center for Informatics Science, Nile University, 12677, Giza, Egypt
Neamat El Gayar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lausser, L., Kestler, H.A. (2010). Robustness Analysis of Eleven Linear Classifiers in Extremely High–Dimensional Feature Spaces. In: Schwenker, F., El Gayar, N. (eds) Artificial Neural Networks in Pattern Recognition. ANNPR 2010. Lecture Notes in Computer Science(), vol 5998. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12159-3_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-12159-3_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12158-6
Online ISBN: 978-3-642-12159-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Robustness Analysis of Eleven Linear Classifiers in Extremely High–Dimensional Feature Spaces

Abstract

Chapter PDF

Similar content being viewed by others

Integrated Classifier: A Tool for Microarray Analysis

Robust linear classification from limited training data

Robust Classification of High-Dimensional Data Using Data-Adaptive Energy Distance

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Robustness Analysis of Eleven Linear Classifiers in Extremely High–Dimensional Feature Spaces

Abstract

Chapter PDF

Similar content being viewed by others

Integrated Classifier: A Tool for Microarray Analysis

Robust linear classification from limited training data

Robust Classification of High-Dimensional Data Using Data-Adaptive Energy Distance

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation