Convolutional Neural Networks Features: Principal Pyramidal Convolution

Guo, Yanming; Lao, Songyang; Liu, Yu; Bai, Liang; Liu, Shi; Lew, Michael S.

doi:10.1007/978-3-319-24075-6_24

Yanming Guo^18,19,
Songyang Lao¹⁹,
Yu Liu¹⁸,
Liang Bai¹⁹,
Shi Liu²⁰ &
…
Michael S. Lew¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9314))

Included in the following conference series:

Pacific Rim Conference on Multimedia

1920 Accesses
1 Citations

Abstract

The features extracted from convolutional neural networks (CNNs) are able to capture the discriminative part of an image and have shown superior performance in visual recognition. Furthermore, it has been verified that the CNN activations trained from large and diverse datasets can act as generic features and be transferred to other visual recognition tasks. In this paper, we aim to learn more from an image and present an effective method called Principal Pyramidal Convolution (PPC). The scheme first partitions the image into two levels, and extracts CNN activations for each sub-region along with the whole image, and then aggregates them together. The concatenated feature is later reduced to the standard dimension using Principal Component Analysis (PCA) algorithm, generating the refined CNN feature. When applied in image classification and retrieval tasks, the PPC feature consistently outperforms the conventional CNN feature, regardless of the network type where they derive from. Specifically, PPC achieves state-of-the-art result on the MIT Indoor67 dataset, utilizing the activations from Places-CNN.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: ECCV (2014)
Google Scholar
Liu, Y., Guo, Y., Wu, S., Lew, M.S.: DeepIndex for accurate and efficient image retrieval. In: ICMR (2015)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., et al.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR (2014)
Google Scholar
Thomee, B., Lew, M.S.: Interactive search in image retrieval: a survey. Int. J. Multimedia Inf. Retrieval 1(2), 71–86 (2012)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)
Article Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (2005)
Google Scholar
Deng, J., Dong, W., Socher, R., et al.: Imagenet: a large-scale hierarchical image database. In: CVPR (2009)
Google Scholar
Zhou, B., Lapedriza, A., Xiao, J., et al.: Learning deep features for scene recognition using places database. In: NIPS (2014)
Google Scholar
Seber, G.A.F.: Multivariate observations. Wiley, New York (2009)
MATH Google Scholar
Oquab, M., Bottou, L., Laptev, I., et al.: Learning and transferring mid-level image representations using convolutional neural networks. In: CVPR (2014)
Google Scholar
Philbin, J., Chum, O., Isard, M., et al.: Object retrieval with large vocabularies and fast spatial matching. In: CVPR (2007)
Google Scholar
Yosinski, J., Clune, J., Bengio, Y., et al.: How transferable are features in deep neural networks? In: NIPS (2014)
Google Scholar
Gong, Y., Wang, L., Guo, R., et al.: Multi-scale orderless pooling of deep convolutional activation features. In: ECCV (2014)
Google Scholar
Koskela, M., Laaksonen, J.: Convolutional network features for scene recognition. In: ACM Multimedia (2014)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006)
Google Scholar
Yang, J., Yu, K., Gong, Y., et al.: Linear spatial pyramid matching using sparse coding for image classification. In: CVPR (2009)
Google Scholar
Wang, J., Yang, J., Yu, K., et al.: Locality-constrained linear coding for image classification. In: CVPR (2010)
Google Scholar
Jolliffe, I.: Principal Component Analysis. Wiley Online Library, New York (2005)
Book MATH Google Scholar
Jia, Y., Shelhamer, E., Donahue, J., et al.: Caffe: convolutional architecture for fast feature embedding. In: ACM Multimedia (2014)
Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: CVIU (2007)
Google Scholar
Quattoni, A., Torralba, A.: Recognizing indoor scenes. In: CVPR (2009)
Google Scholar
Jegou, H., Douze, M., Schmid, C.: Hamming embedding and weak geometric consistency for large scale image search. In: ECCV (2008)
Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. In: ACM TIST (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

LIACS Media Lab, Leiden University, Niels Bohrweg, 1, Leiden, The Netherlands
Yanming Guo, Yu Liu & Michael S. Lew
Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha, China
Yanming Guo, Songyang Lao & Liang Bai
School of Arts and Media, Beijing Normal University, Beijing, China
Shi Liu

Authors

Yanming Guo
View author publications
You can also search for this author in PubMed Google Scholar
Songyang Lao
View author publications
You can also search for this author in PubMed Google Scholar
Yu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Liang Bai
View author publications
You can also search for this author in PubMed Google Scholar
Shi Liu
View author publications
You can also search for this author in PubMed Google Scholar
Michael S. Lew
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yanming Guo .

Editor information

Editors and Affiliations

Gwangju Institute of Science and Technology, Gwangju, Korea (Republic of)
Yo-Sung Ho
Chinese Academy of Sciences, Institute of Automation, Beijing, China
Jitao Sang
ICU, IVY Lab, KAIST, Daejeon, Korea (Republic of)
Yong Man Ro
KAIST, Daejeon, Korea (Republic of)
Junmo Kim
College of Computer Science, Zhejiang University, Hangzhou, China
Fei Wu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guo, Y., Lao, S., Liu, Y., Bai, L., Liu, S., Lew, M.S. (2015). Convolutional Neural Networks Features: Principal Pyramidal Convolution. In: Ho, YS., Sang, J., Ro, Y., Kim, J., Wu, F. (eds) Advances in Multimedia Information Processing -- PCM 2015. PCM 2015. Lecture Notes in Computer Science(), vol 9314. Springer, Cham. https://doi.org/10.1007/978-3-319-24075-6_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-24075-6_24
Published: 22 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24074-9
Online ISBN: 978-3-319-24075-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics