Parallel Perceptrons, Activation Margins and Imbalanced Training Set Pruning

Cantador, Iván; Dorronsoro, José R.

doi:10.1007/11492542_6

Iván Cantador¹⁹ &
José R. Dorronsoro¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3523))

Included in the following conference series:

Iberian Conference on Pattern Recognition and Image Analysis

1607 Accesses
6 Citations

Abstract

A natural way to deal with training samples in imbalanced class problems is to prune them removing redundant patterns, easy to classify and probably over represented, and label noisy patterns that belonging to one class are labelled as members of another. This allows classifier construction to focus on borderline patterns, likely to be the most informative ones. To appropriately define the above subsets, in this work we will use as base classifiers the so–called parallel perceptrons, a novel approach to committee machine training that allows, among other things, to naturally define margins for hidden unit activations. We shall use these margins to define the above pattern types and to iteratively perform subsample selections in an initial training set that enhance classification accuracy and allow for a balanced classifier performance even when class sizes are greatly different.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

ASTra: A Novel Algorithm-Level Approach to Imbalanced Classification

Perceptron: An Old Folk Song Sung on a New Stage

Classification of Binary Imbalanced Data Using A Bayesian Ensemble of Bayesian Neural Networks

References

Auer, P., Burgsteiner, H., Maass, W.: Reducing Communication for Distributed Learning in Neural Networks. In: Dorronsoro, J.R. (ed.) ICANN 2002. LNCS, vol. 2415, pp. 123–128. Springer, Heidelberg (2002)
Chapter Google Scholar
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth (1983)
Google Scholar
Chawla, N., Bowyer, K., Hall, L., Kegelmeyer, W.: SMOTE: Synthetic Minority Oversampling Technique. Journal of Artificial Intelligence Research 16, 321–357 (2002)
MATH Google Scholar
Dorronsoro, J., Ginel, F., Sánchez, C., Santa Cruz, C.: Neural Fraud Detection in Credit Card Operations. IEEE Transactions on Neural Networks 8, 827–834 (1997)
Article Google Scholar
Fawcett, T., Provost, F.: Adaptive Fraud Detection. Journal of Data Mining and Knowledge Discovery 1, 291–316 (1997)
Article Google Scholar
Freund, Y.: Boosting a weak learning algorithm by majority. Information and Computation 121, 256–285 (1995)
Article MATH MathSciNet Google Scholar
Kubat, M., Matwin, S.: Addressing the Curse of Imbalanced Training Sets: One- Sided Selection. In: Proceedings of the 14th International Conference on Machine Learning, ICML 1997, Nashville, TN, U.S.A., pp. 179–186 (1997)
Google Scholar
Maloof, M.A.: Learning when data sets are imbalanced and when costs are unequal and unknown. In: ICML-2003 Workshop on Learning from Imbalanced Data Sets II (2003)
Google Scholar
Murphy, P., Aha, D.: UCI Repository of Machine Learning Databases, Tech. Report, University of Califonia, Irvine (1994)
Google Scholar
Nilsson, N.: The Mathematical Foundations of Learning Machines. Morgan Kaufmann, San Francisco (1990)
MATH Google Scholar
Swets, J.A.: Measuring the accuracy of diagnostic systems. Science 240, 1285–1293 (1998)
Article MathSciNet Google Scholar
Weiss, G.M., Provost, F.: The effect of class distribution on classifier learning, Technical Report ML-TR 43, Department of Computer Science, Rutgers University (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Dpto. de Ingeniería Informática and Instituto de Ingeniería del Conocimiento, Universidad Autónoma de Madrid, 28049, Madrid, Spain
Iván Cantador & José R. Dorronsoro

Authors

Iván Cantador
View author publications
You can also search for this author in PubMed Google Scholar
José R. Dorronsoro
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Instituto Superior Técnico & Instituto de Sistemas e Robótica,, 1049-001, Lisboa, Portugal
Jorge S. Marques
ETSI Informática y e Telecomunicación, University of Granada, 18071, Granada, Spain
Nicolás Pérez de la Blanca
Instituto Superior Técnico, CERENA-Centro de Recursos Naturais e Ambiente, Av. Rovisco Pais, 1049-001, Lisboa, Portugal
Pedro Pina

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cantador, I., Dorronsoro, J.R. (2005). Parallel Perceptrons, Activation Margins and Imbalanced Training Set Pruning. In: Marques, J.S., Pérez de la Blanca, N., Pina, P. (eds) Pattern Recognition and Image Analysis. IbPRIA 2005. Lecture Notes in Computer Science, vol 3523. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11492542_6

Download citation

DOI: https://doi.org/10.1007/11492542_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26154-4
Online ISBN: 978-3-540-32238-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Parallel Perceptrons, Activation Margins and Imbalanced Training Set Pruning

Abstract

Access this chapter

Preview

Similar content being viewed by others

ASTra: A Novel Algorithm-Level Approach to Imbalanced Classification

Perceptron: An Old Folk Song Sung on a New Stage

Classification of Binary Imbalanced Data Using A Bayesian Ensemble of Bayesian Neural Networks

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Parallel Perceptrons, Activation Margins and Imbalanced Training Set Pruning

Abstract

Access this chapter

Preview

Similar content being viewed by others

ASTra: A Novel Algorithm-Level Approach to Imbalanced Classification

Perceptron: An Old Folk Song Sung on a New Stage

Classification of Binary Imbalanced Data Using A Bayesian Ensemble of Bayesian Neural Networks

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation