Abstract
We here address the problem of scale and orientation invariant object recognition, making use of a correspondence-based mechanism, in which the identity of an object represented by sensory signals is determined by matching it to a representation stored in memory. The sensory representation is in general affected by various transformations, notably scale and rotation, thus giving rise to the fundamental problem of invariant object recognition. We focus here on a neurally plausible mechanism that deals simultaneously with identification of the object and detection of the transformation, both types of information being important for visual processing. Our mechanism is based on macrocolumnar units. These evaluate identity- and transformation-specific feature similarities, performing competitive computation on the alternatives of their own subtask, and cooperate to make a coherent global decision for the identity, scale and rotation of the object.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M., Poggio, T.: Robust Object Recognition with Cortex-Like Mechanisms. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(3), 411–426 (2007)
Rosenblatt, F.: Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms. Spartan Books, Washington (1961)
Gabor, D.: Theory of Communication. J. IEE 93, 429–459 (1946)
Daugmann, J.G.: Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. J. Opt. Soc. Am. A 2, 1160–1169 (1985)
Jones, J., Palmer, L.: An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. J. Neurophysiol. 58, 1233–1258 (1987)
Lowe, D.G.: Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Arathorn, D.W.: Map-Seeking circuits in Visual Cognition — A Computational Mechanism for Biological and Machine Vision. Standford Univ. Press, Stanford (2002)
Serre, T., Kouh, M., Cadieu, C., Knoblich, U., Kreiman, G., Poggio, T.: A Theory of Object Recognition: Computations and Circuits in Feedforwad Path of the Ventral Stream in Primate Visual Cortex. AI Memo 2005-036/CBCL Memo 259. Massachusetts Inst. of Technology, Cambridge (2005)
Sato, Y.D., Wolff, C., Wolfrum, P., von der Malsburg, C.: Dynamic Link Matching between Feature Columns for Different Scale and Orientation. In: Ishikawa, M., Doya, K., Miyamoto, H., Yamakawa, T. (eds.) ICONIP 2007, Part I. LNCS, vol. 4984, pp. 385–394. Springer, Heidelberg (2008)
Jitsev, J., Sato, Y.D., von der Malsburg, C.: A Neural System for Scale and Orientation Invariant Correspondence Finding. In: Proc. COSYNE 2008, Salt Lake City, Utah, USA (2008)
von der Malsburg, C.: Dynamic Link Architecture. In: The handbook of brain theory and neural networks, pp. 329–331. MIT Press, Cambridge (1998)
Wiskott, L., Fellous, J.-M., Krüger, N., von der Malsburg, C.: Face recognition by elastic bunch graph matching. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(7), 775–779 (1997)
Lücke, J., von der Malsburg, C.: Rapid Correspondence Finding in Networks of Cortical Columns. In: Kollias, S.D., Stafylopatis, A., Duch, W., Oja, E. (eds.) ICANN 2006. LNCS, vol. 4131, pp. 668–677. Springer, Heidelberg (2006)
Freedman, D.J., Riesenhuber, M., Poggio, T., Miller, E.K.: Categorical representation of visual stimuli in the primate prefrontal cortex. Science 291, 312–316 (2001)
Freedman, D.J., Riesenhuber, M., Poggio, T., Miller, E.K.: Visual categorization and the primate prefrontal cortex: Neurophysiology and behavior. J. Neurophys. 88, 930–942 (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sato, Y.D., Jitsev, J., von der Malsburg, C. (2008). A Visual Object Recognition System Invariant to Scale and Rotation. In: Kůrková, V., Neruda, R., KoutnÃk, J. (eds) Artificial Neural Networks - ICANN 2008. ICANN 2008. Lecture Notes in Computer Science, vol 5163. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87536-9_101
Download citation
DOI: https://doi.org/10.1007/978-3-540-87536-9_101
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87535-2
Online ISBN: 978-3-540-87536-9
eBook Packages: Computer ScienceComputer Science (R0)