Limited Generalization Capabilities of Autoencoders with Logistic Regression on Training Sets of Small Sizes

Potapov, Alexey; Batishcheva, Vita; Peterson, Maxim

doi:10.1007/978-3-662-44654-6_25

Alexey Potapov^4,5,
Vita Batishcheva⁵ &
Maxim Peterson⁴

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 436))

Included in the following conference series:

IFIP International Conference on Artificial Intelligence Applications and Innovations

1876 Accesses
2 Citations

Abstract

Deep learning is promising approach to extract useful nonlinear representations of data. However, it is usually applied with large training sets, which are not always available in practical tasks. In this paper, we consider stacked autoencoders with logistic regression as the classification layer and study their usefulness for the task of image categorization depending on the size of training sets. Hand-crafted image descriptors are proposed and used for training autoencoders in addition to pixel-level features. New multi-column architecture for autoencoders is also proposed. Conducted experiments showed that useful nonlinear features can be learnt by (stacked) autoencoders only using large training sets, but they can yield positive results due to redundancy reduction also on small training sets. Practically useful results (9.1% error rate for 6 classes) were achieved only using hand-crafted features on the training set containing 4800 images.

Download to read the full chapter text

Chapter PDF

Performance Analysis of Deep Neural Network and Stacked Autoencoder for Image Classification

An effective and efficient broad-based ensemble learning model for moderate-large scale image recognition

Article 24 September 2022

Double-Layer Stacked Denoising Autoencoders for Regression

Keywords

References

He, Y., Kavukcuoglu, K., Wang, Y., Szlam, A., Qi, Y.: Unsupervised Feature Learning by Deep Sparse Coding. arXiv:1312.5783 [cs.LG] (2013)
Google Scholar
Le Roux, N., Bengio, Y.: Representational Power of Restricted Boltzmann Machines and Deep Belief Networks. Neural Computation 20(6), 1631–1649 (2008)
Article MathSciNet MATH Google Scholar
Gregor, K., Mnih, A., Wierstra, D.: Deep AutoRegressive Networks. arXiv:1310.8499 [cs.LG] (2013)
Google Scholar
Hinton, G.E., Osindero, S., Teh, Y.: A Fast Learning Algorithm for Deep Belief Nets. Neural Computation 18, 1527–1554 (2006)
Article MathSciNet MATH Google Scholar
Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H.: Greedy Layer-Wise Training of Deep Networks. In: Advances in Neural Information Processing Systems (NIPS 2006), vol. 19, pp. 153–160 (2007)
Google Scholar
Ranzato, M.A., Poultney, C., Chopra, S., LeCun, Y.: Efficient Learning of Sparse Representations with an Energy-Based Model. In: Advances in Neural Information Processing Systems (NIPS 2006), vol. 19 (2007)
Google Scholar
Ciresan, D.C., Meier, U., Gambardella, L.M., Schmidhuber, J.: Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition. arXiv:1003.0358 [cs.NE] (2010)
Google Scholar
LeCun, Y., Cortes, C.: MNIST handwritten digit database, http://yann.lecun.com/exdb/mnist/
Tenenbaum, J.B., Kemp, C., Griffiths, T.L., Goodman, N.D.: How to Grow a Mind: Statistics, Structure, and Abstraction. Science 331(6022), 1279–1285 (2011)
Article MathSciNet MATH Google Scholar
Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.-A.: Extracting and composing Robust Features with Denoising Autoencoders. In: Proc. 25th International Conference on Machine Learning, pp. 1096–1103 (2008)
Google Scholar
Cireşan, D., Meier, U., Masci, J., Schmidhuber, J.: Multi-Column Deep Neural Network for Traffic Sign Classification. Neural Networks 32, 333–338 (2012)
Article Google Scholar
Vailaya, A., Jain, A., Zhang, H.J.: On Image Classification: City Images vs. Landscapes. Pattern Recognition 31(12), 1921–1935 (1998)
Article Google Scholar
Haralick, R.M., Shanmugam, K., Dinstein, I.: Textural Features for Image Classification. IEEE Transactions on Systems, Man, and Cybernetics. SMC-3(6), 610–621 (1973)
Google Scholar
Deriche, R.: Using Canny’s Criteria to Derive a Recursively Implemented Optimal Edge Detector. International Journal of Computer Vision 1(2), 167–187 (1987)
Article Google Scholar

Download references

Author information

Authors and Affiliations

St. Petersburg National Research, University of Information Technologies, Mechanics and Optics, Kronverkskiy pr. 49, 197101, St. Petersburg, Russia
Alexey Potapov & Maxim Peterson
St. Petersburg State University, Universitetskaya nab. 7-9, 199034, St.Petersburg, Russia
Alexey Potapov & Vita Batishcheva

Authors

Alexey Potapov
View author publications
You can also search for this author in PubMed Google Scholar
Vita Batishcheva
View author publications
You can also search for this author in PubMed Google Scholar
Maxim Peterson
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Forestry and Management of the Environment, Democritus University of Thrace, Pandazidou 193, 68200, Orestiada, Greece
Lazaros Iliadis
Department of Digital Systems, University of Piraeus, 80, Karaoli and Dimitriou Str., 18534, Piraeus, Greece
Ilias Maglogiannis
Department of Computer Science and Engineering, Frederick University, 7 Yianni Frederickou Str., Pallouriotissa, 1036, Nicosia, Cyprus
Harris Papadopoulos

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Potapov, A., Batishcheva, V., Peterson, M. (2014). Limited Generalization Capabilities of Autoencoders with Logistic Regression on Training Sets of Small Sizes. In: Iliadis, L., Maglogiannis, I., Papadopoulos, H. (eds) Artificial Intelligence Applications and Innovations. AIAI 2014. IFIP Advances in Information and Communication Technology, vol 436. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44654-6_25

Download citation

DOI: https://doi.org/10.1007/978-3-662-44654-6_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-44653-9
Online ISBN: 978-3-662-44654-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Limited Generalization Capabilities of Autoencoders with Logistic Regression on Training Sets of Small Sizes

Abstract

Chapter PDF

Similar content being viewed by others

Performance Analysis of Deep Neural Network and Stacked Autoencoder for Image Classification

An effective and efficient broad-based ensemble learning model for moderate-large scale image recognition

Double-Layer Stacked Denoising Autoencoders for Regression

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Limited Generalization Capabilities of Autoencoders with Logistic Regression on Training Sets of Small Sizes

Abstract

Chapter PDF

Similar content being viewed by others

Performance Analysis of Deep Neural Network and Stacked Autoencoder for Image Classification

An effective and efficient broad-based ensemble learning model for moderate-large scale image recognition

Double-Layer Stacked Denoising Autoencoders for Regression

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation