On Generalization Ability of Cascaded Neural Net Architecture

Kamruzzaman, Joarder; Kumagai, Yukio; Hikita, Hiromitsu

doi:10.1007/978-1-4471-2063-6_296

Joarder Kamruzzaman²,
Yukio Kumagai³ &
Hiromitsu Hikita⁴

Included in the following conference series:

International Conference on Artificial Neural Networks

127 Accesses

Abstract

We present construction of a 3-layer feedforward network called cascaded network and make a performance comparison with the conventional Backpropagation network [1] in terms of generalization ability of the trained networks. Cascaded networks are trained to realize category classification employing binary input vectors and locally represented target output vectors. Empirical study in [2] shows that when a neural network is trained to classify binary input vectors into locally represented binary target output vectors, better saturation of hidden outputs in response to the training set yields better robustness of the network, and in conventional Backpropagation network hidden outputs usually do not get saturated in response to the training set. Based on this observation a 3-layer cascaded network is constructed by cascading two 2-layer networks trained independently by delta rule [3] whose intermediate layer can be viewed as hidden layer and is trained to attain preassigned saturated outputs in response to the training set. Each input-output pair is associated with a preassigned code. Each 2-layer network can thus be considered as a separate module; the first module is trained with the training pattern as the binary input signal and the corresponding preassigned code as the teacher signal. The second module is trained with this preassigned code as the input signal and the desired binary output of the corresponding training pattern as the teacher signal. Thus the first module maps the training patterns onto the preassigned codes and the second maps these preassigned codes onto the desired outputs. After cascading, outputs of the first module become inputs to the second module and thus the intermediate layer of the resultant 3-layer cascaded network can be considered as the hidden layer which is trained to attain predefined saturated outputs as the internal representation of the training set. For linearly separable tasks cascaded network can be built straightforwardly as described earlier, and for nonlinearly separable task, cascaded network is constructed by employing high order cross product inputs at the input layer. These high order cross product inputs are implemented as the mutually disjoint cross products of the components of the binary input vectors received from the training patterns. Experimenting with character recognition problems we demonstrate by simulation results that, for both linearly and nonlinearly separable tasks with binary input and locally represented binary target out put vectors, cascaded network yields generalization ability far better than that of Backpropagation network. The better performance of cascaded network is duo to the attainment of saturated hidden outputs in response to the training set.

Download to read the full chapter text

Chapter PDF

Backpropagation for Fully Connected Cascade Networks

Article 25 January 2017

Learning Rule for Linear Multilayer Feedforward ANN by Boosted Decision Stumps

Why Dose Layer-by-Layer Pre-training Improve Deep Neural Networks Learning?

References

D.E. Rumelhalt, J.L. McClelland and the PDP research Group, Parallel Distributed Processing, vol. 1, M.LT. Press, 1986.
Google Scholar
M. Hamamoto, J. Kamruzzaman, Y. Kumagai and H. Hikita, “Generalization ability of feedforward neural network trained by Fahlman and Lebiere’s learning algorithm,” IE1CE Trans. Fundamentals, Japan, vol. E75-A, no. 11, pp. 1597–1601, Nov. 1992.
Google Scholar
B. Widrow and M. E. Hoff, “Adaptive switching circuits,” IRE WESTCON Convention Record, part 4, pp. 96–104, 1960.
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Electrical & Electronic Engg., Bangladesh Univ. of Engg. & Tech., Dhaka, 1000, Bangladesh
Joarder Kamruzzaman
Dept. of Comp. Science & Syst. Engg., Muroran Inst, of Tech., 27-1 Mizumoto-cho, Muroran 050, Japan
Yukio Kumagai
Dept. of Mechanical Syst. Engg., Muroran Inst, of Tech., 27-1 Mizumoto-cho, Muroran 050, Japan
Hiromitsu Hikita

Authors

Joarder Kamruzzaman
View author publications
You can also search for this author in PubMed Google Scholar
Yukio Kumagai
View author publications
You can also search for this author in PubMed Google Scholar
Hiromitsu Hikita
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dutch Foundation for Neural Networks, University of Nijmegen, Geert Grooteplein 21, 6525 EZ, Nijmegen, The Netherlands
Stan Gielen & Bert Kappen &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kamruzzaman, J., Kumagai, Y., Hikita, H. (1993). On Generalization Ability of Cascaded Neural Net Architecture. In: Gielen, S., Kappen, B. (eds) ICANN ’93. ICANN 1993. Springer, London. https://doi.org/10.1007/978-1-4471-2063-6_296

Download citation

DOI: https://doi.org/10.1007/978-1-4471-2063-6_296
Published: 10 April 2012
Publisher Name: Springer, London
Print ISBN: 978-3-540-19839-0
Online ISBN: 978-1-4471-2063-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

On Generalization Ability of Cascaded Neural Net Architecture

Abstract

Chapter PDF

Similar content being viewed by others

Backpropagation for Fully Connected Cascade Networks

Learning Rule for Linear Multilayer Feedforward ANN by Boosted Decision Stumps

Why Dose Layer-by-Layer Pre-training Improve Deep Neural Networks Learning?

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

On Generalization Ability of Cascaded Neural Net Architecture

Abstract

Chapter PDF

Similar content being viewed by others

Backpropagation for Fully Connected Cascade Networks

Learning Rule for Linear Multilayer Feedforward ANN by Boosted Decision Stumps

Why Dose Layer-by-Layer Pre-training Improve Deep Neural Networks Learning?

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation