Weak Metric Learning for Feature Fusion towards Perception-Inspired Object Recognition

Li, Xiong; Zhao, Xu; Fu, Yun; Liu, Yuncai

doi:10.1007/978-3-642-11301-7_29

Xiong Li²¹,
Xu Zhao²¹,
Yun Fu²² &
…
Yuncai Liu²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5916))

Included in the following conference series:

International Conference on Multimedia Modeling

2134 Accesses

Abstract

With extracted local features of a given image, computing its global feature under perceptual framework has shown promising performance in object recognition. However, under some tough applications with large intra-class variance, using only one kind of local feature is inadequate to build a robust classification system. To integrate the discriminability of complementary local features, in this paper, we extend the efficacy of perceptual framework to adapt to heterogeneous features. Given multiple raw global features, we propose a fusion strategy through metric learning, which is called weak metric learning in this work, for fusing high dimensional features. The fusion model is solved with the maximal kernel canonical correlation formulation with the multiple global features as outputs. Experimental results show that our method achieves significant improvements about 5% to 11% than the benchmark perceptual framework system, HMAX, on several difficult categories of object recognition with much less training samples and feature elements.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fei-Fei, L., Fergus, R., Perona., P.: Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. In: IEEE CVPR, Workshop on Generative-Model Based Vision (2004)
Google Scholar
Frome, A., Singer, Y., Malik, J.: Image retrieval and classification using local distance functions. In: NIPS (2007)
Google Scholar
Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M., Poggio, T.: Robust object recognition with cortex-like mechanisms. IEEE TPAMI 29(3), 411–426 (2007)
Google Scholar
Rosch, E.: Natural Categories. Cognitive Psychology 4(3), 328–350 (1973)
Article Google Scholar
Riesenhuber, M., Poggio, T.: Hierarchical models of object recognition in cortex. Nature Neuroscience 2, 1019–1025 (1999)
Article Google Scholar
Schultz, M., Joachims, T.: Learning a distance metric from relative comparisons. In: NIPS (2004)
Google Scholar
Zhang, H., Berg, A., Maire, M., Malik, J.: SVM-KNN: Discriminative nearest neighbor classification for visual category recognition. In: IEEE CVPR (2006)
Google Scholar
Lowe, D.: Object recognition from local scale-invariant features. In: IEEE ICCV (1999)
Google Scholar
Fu, Y., Cao, L., Guo, G., Huang, T.S.: Multiple feature fusion by subspace learning. In: ACM CIVR, pp. 127–134 (2008)
Google Scholar
Lin, Y., Liu, T., Fuh, C.: Dimensionality Reduction for Data in Multiple Feature Representations. In: NIPS (2008)
Google Scholar
Lai, P., Fyfe, C.: Kernel and nonlinear canonical correlation analysis. International Journal of Neural Systems 10(5), 365–378 (2000)
Google Scholar
Hardoon, D., Szedmak, S., Shawe-Taylor, J.: Canonical correlation analysis: an overview with application to learning methods. Neural Computation 16(12), 2639–2664 (2004)
Article MATH Google Scholar
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE TPAMI, 509–522 (2002)
Google Scholar
Berg, A., Malik, J.: Geometric blur and template matching. In: IEEE CVPR (2001)
Google Scholar
Lindeberg, T.: Scale-space: A framework for handling image structures at multiple scales. European Organization for Nuclear Research-Reports-CERN, 27–38 (1996)
Google Scholar
Fu, Y., Yan, S., Huang, T.: Correlation metric for generalized feature extraction. IEEE TPAMI, 2229–2235 (2008)
Google Scholar
Kim, T., Kittler, J., Cipolla, R.: Discriminative learning and recognition of image set classes using canonical correlations. IEEE TPAMI 29(6), 1005–1018 (2007)
Google Scholar
Sun, Q., Zeng, S., Liu, Y., Heng, P., Xia, D.: A new method of feature fusion and its application in image recognition. Pattern Recognition 38(12), 2437–2448 (2005)
Article Google Scholar
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide-baseline stereo from maximally stable extremal regions. Image and Vision Computing 22(10), 761–767 (2004)
Article Google Scholar
Mikolajczyk, K., Schmid, C.: An affine invariant interest point detector. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2350, pp. 128–142. Springer, Heidelberg (2002)
Chapter Google Scholar
Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., Gool, L.: A comparison of affine region detectors. International Journal of Computer Vision 65(1), 43–72 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Image Processing & Pattern Recognition, Shanghai Jiao Tong University, Shanghai, 200240, China
Xiong Li, Xu Zhao & Yuncai Liu
Department of CSE, University at Buffalo (SUNY), NY, 14260, USA
Yun Fu

Authors

Xiong Li
View author publications
You can also search for this author in PubMed Google Scholar
Xu Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Yun Fu
View author publications
You can also search for this author in PubMed Google Scholar
Yuncai Liu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Oldenburg, Germany
Susanne Boll
University of Texas at San Antonio,, TX, San Antonio, USA
Qi Tian
Microsoft Research Asia, Beijing, P.R. China
Lei Zhang
Southwest University, Beibei, Chongqing, China
Zili Zhang
School of Engineering and Information Technology, Deakin University, 221 Burwood Highway, Vic, 3125, Australia
Yi-Ping Phoebe Chen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, X., Zhao, X., Fu, Y., Liu, Y. (2010). Weak Metric Learning for Feature Fusion towards Perception-Inspired Object Recognition. In: Boll, S., Tian, Q., Zhang, L., Zhang, Z., Chen, YP.P. (eds) Advances in Multimedia Modeling. MMM 2010. Lecture Notes in Computer Science, vol 5916. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11301-7_29

Download citation

DOI: https://doi.org/10.1007/978-3-642-11301-7_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11300-0
Online ISBN: 978-3-642-11301-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics