Abstract
Recent successes in the use of sparse coding for many computer vision applications have triggered the attention towards the problem of how an over-complete dictionary should be learned from data. This is because the quality of a dictionary greatly affects performance in many respects, including computational. While so far the focus has been on learning compact, reconstructive, and discriminative dictionaries, in this work we propose to retain the previous qualities, and further enhance them by learning a dictionary that is able to predict the contextual information surrounding a sparsely coded signal. The proposed framework leverages the K-SVD for learning, fully inheriting its benefits of simplicity and efficiency. A model of structured prediction is designed around this approach, which leverages contextual information to improve the combined recognition and localization of multiple objects from multiple classes within one image. Results on the PASCAL VOC 2007 dataset are in line with the state-of-the-art, and clearly indicate that this is a viable approach for learning a context aware dictionary for sparse representation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Elad, M., Aharon, M.: Image denoising via sparse and redundant representations over learned dictionaries. IEEE TIP 15, 3736–3745 (2006)
Mairal, J., Elad, M., Sapiro, G.: Sparse representation for color image restoration. IEEE TIP 17, 53–69 (2008)
Wright, J., Yang, A., Ganesh, A., Sastry, S., Ma, Y.: Robust face recognition via sparse representation. IEEE TPAMI 31, 210–227 (2009)
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: IEEE CVPR, pp. 1794–1801 (2009)
Zhang, Q., Li, B.: Discriminative k-svd for dictionary learning in face recognition. In: IEEE CVPR, pp. 2691–2698 (2010)
Mairal, J., Bach, F., Ponce, J., Sapiro, G., Zisserman, A.: Discriminative learned dictionaries for local image analysis. In: IEEE CVPR, pp. 1–8 (2008)
Pham, D.S., Venkatesh, S.: Joint learning and dictionary construction for pattern recognition. In: IEEE CVPR, pp. 1–8 (2008)
Engan, K., Aase, S., Husoy, J.: Frame based signal compression using method of optimal directions (mod). In: IEEE ISCAS, vol. 4, pp. 1–4 (1999)
Aharon, M., Elad, M., Bruckstein, A.: K-svd: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE TSP 54, 4311–4322 (2006)
Jiang, Z., Lin, Z., Davis, L.S.: Learning a discriminative dictionary for sparse coding via label consistent k-svd. In: IEEE CVPR, pp. 1697–1704 (2011)
Huang, H., Aviyiente, S.: Sparse representation for signal classification. In: NIPS (2007)
Boureau, Y.L., Bach, F., LeCun, Y., Ponce, J.: Learning mid-level features for recognition. In: IEEE CVPR, pp. 2559–2566 (2010)
Desai, C., Ramanan, D., Fowlkes, C.: Discriminative models for multi-class object layout. IJCV 95, 1–12 (2011)
Divvala, S., Hoiem, D., Hays, J., Efros, A., Hebert, M.: An empirical study of context in object detection. In: IEEE CVPR, pp. 1271–1278 (2009)
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE TPAMI 32, 1627–1645 (2010)
Galleguillos, C., McFee, B., Belongie, S., Lanckriet, G.: Multi-class object localization by combining local contextual interactions. In: IEEE CVPR, pp. 113–120 (2010)
Rabinovich, A., Vedaldi, A., Galleguillos, C., Wiewiora, E., Belongie, S.: Objects in context. In: IEEE ICCV, pp. 1–8 (2007)
Torralba, A.: Contextual priming for object detection. IJCV 53, 169–191 (2003), 10.1023/A:1023052124951
Yao, B., Fei-Fei, L.: Modeling mutual context of object and human pose in human-object interaction activities. In: IEEE CVPR, pp. 17–24 (2010)
Sadeghi, M., Farhadi, A.: Recognition using visual phrases. In: IEEE CVPR, pp. 1745–1752 (2011)
Choi, M.J., Lim, J., Torralba, A., Willsky, A.: Exploiting hierarchical context on a large database of object categories. In: IEEE CVPR, pp. 129–136 (2010)
Park, D., Ramanan, D., Fowlkes, C.: Multiresolution Models for Object Detection. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 241–254. Springer, Heidelberg (2010)
Mairal, J., Bach, F., Ponce, J., Sapiro, G., Zisserman, A.: Supervised dictionary learning. In: NIPS, CORR abs/0809.3083 (2008)
Yang, J., Yu, K., Huang, T.: Supervised translation-invariant sparse coding. In: IEEE CVPR, pp. 3517–3524 (2010)
Everingham, M., Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (voc) challenge. IJCV 88, 303–338 (2010)
Chen, S., Cowan, C., Grant, P.: Orthogonal least squares learning algorithm for radial basis function networks. IEEE TNN 2, 302–309 (1991)
Golub, G.H., Hansen, P.C., O’Leary, D.P.: Tikhonov regularization and total least squares. SIAM J. Matrix Anal. Appl. 21, 185–194 (1999)
Levy, A., Lindenbaum, M.: Sequential Karhunen-Loeve basis extraction and its application to images. IEEE TIP 2, 456–460 (1998)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE CVPR, vol. 1, pp. 886–893 (2005)
van de Sande, K.E.A., Uijlings, J.R.R., Gevers, T., Smeulders, A.W.M.: Segmentation as selective search for object recognition. In: IEEE CVPR, pp. 1879–1886 (2011)
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge, VOC 2007 (2007), Results, http://www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html
Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: IEEE CVPR, pp. 1–8 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Siyahjani, F., Doretto, G. (2013). Learning a Context Aware Dictionary for Sparse Representation. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds) Computer Vision – ACCV 2012. ACCV 2012. Lecture Notes in Computer Science, vol 7725. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37444-9_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-37444-9_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37443-2
Online ISBN: 978-3-642-37444-9
eBook Packages: Computer ScienceComputer Science (R0)