Skip to main content

Learning a Context Aware Dictionary for Sparse Representation

  • Conference paper
Computer Vision – ACCV 2012 (ACCV 2012)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7725))

Included in the following conference series:

Abstract

Recent successes in the use of sparse coding for many computer vision applications have triggered the attention towards the problem of how an over-complete dictionary should be learned from data. This is because the quality of a dictionary greatly affects performance in many respects, including computational. While so far the focus has been on learning compact, reconstructive, and discriminative dictionaries, in this work we propose to retain the previous qualities, and further enhance them by learning a dictionary that is able to predict the contextual information surrounding a sparsely coded signal. The proposed framework leverages the K-SVD for learning, fully inheriting its benefits of simplicity and efficiency. A model of structured prediction is designed around this approach, which leverages contextual information to improve the combined recognition and localization of multiple objects from multiple classes within one image. Results on the PASCAL VOC 2007 dataset are in line with the state-of-the-art, and clearly indicate that this is a viable approach for learning a context aware dictionary for sparse representation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Elad, M., Aharon, M.: Image denoising via sparse and redundant representations over learned dictionaries. IEEE TIP 15, 3736–3745 (2006)

    MathSciNet  Google Scholar 

  2. Mairal, J., Elad, M., Sapiro, G.: Sparse representation for color image restoration. IEEE TIP 17, 53–69 (2008)

    MathSciNet  Google Scholar 

  3. Wright, J., Yang, A., Ganesh, A., Sastry, S., Ma, Y.: Robust face recognition via sparse representation. IEEE TPAMI 31, 210–227 (2009)

    Article  Google Scholar 

  4. Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: IEEE CVPR, pp. 1794–1801 (2009)

    Google Scholar 

  5. Zhang, Q., Li, B.: Discriminative k-svd for dictionary learning in face recognition. In: IEEE CVPR, pp. 2691–2698 (2010)

    Google Scholar 

  6. Mairal, J., Bach, F., Ponce, J., Sapiro, G., Zisserman, A.: Discriminative learned dictionaries for local image analysis. In: IEEE CVPR, pp. 1–8 (2008)

    Google Scholar 

  7. Pham, D.S., Venkatesh, S.: Joint learning and dictionary construction for pattern recognition. In: IEEE CVPR, pp. 1–8 (2008)

    Google Scholar 

  8. Engan, K., Aase, S., Husoy, J.: Frame based signal compression using method of optimal directions (mod). In: IEEE ISCAS, vol. 4, pp. 1–4 (1999)

    Google Scholar 

  9. Aharon, M., Elad, M., Bruckstein, A.: K-svd: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE TSP 54, 4311–4322 (2006)

    Google Scholar 

  10. Jiang, Z., Lin, Z., Davis, L.S.: Learning a discriminative dictionary for sparse coding via label consistent k-svd. In: IEEE CVPR, pp. 1697–1704 (2011)

    Google Scholar 

  11. Huang, H., Aviyiente, S.: Sparse representation for signal classification. In: NIPS (2007)

    Google Scholar 

  12. Boureau, Y.L., Bach, F., LeCun, Y., Ponce, J.: Learning mid-level features for recognition. In: IEEE CVPR, pp. 2559–2566 (2010)

    Google Scholar 

  13. Desai, C., Ramanan, D., Fowlkes, C.: Discriminative models for multi-class object layout. IJCV 95, 1–12 (2011)

    Article  MathSciNet  MATH  Google Scholar 

  14. Divvala, S., Hoiem, D., Hays, J., Efros, A., Hebert, M.: An empirical study of context in object detection. In: IEEE CVPR, pp. 1271–1278 (2009)

    Google Scholar 

  15. Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE TPAMI 32, 1627–1645 (2010)

    Article  Google Scholar 

  16. Galleguillos, C., McFee, B., Belongie, S., Lanckriet, G.: Multi-class object localization by combining local contextual interactions. In: IEEE CVPR, pp. 113–120 (2010)

    Google Scholar 

  17. Rabinovich, A., Vedaldi, A., Galleguillos, C., Wiewiora, E., Belongie, S.: Objects in context. In: IEEE ICCV, pp. 1–8 (2007)

    Google Scholar 

  18. Torralba, A.: Contextual priming for object detection. IJCV 53, 169–191 (2003), 10.1023/A:1023052124951

    Google Scholar 

  19. Yao, B., Fei-Fei, L.: Modeling mutual context of object and human pose in human-object interaction activities. In: IEEE CVPR, pp. 17–24 (2010)

    Google Scholar 

  20. Sadeghi, M., Farhadi, A.: Recognition using visual phrases. In: IEEE CVPR, pp. 1745–1752 (2011)

    Google Scholar 

  21. Choi, M.J., Lim, J., Torralba, A., Willsky, A.: Exploiting hierarchical context on a large database of object categories. In: IEEE CVPR, pp. 129–136 (2010)

    Google Scholar 

  22. Park, D., Ramanan, D., Fowlkes, C.: Multiresolution Models for Object Detection. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 241–254. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  23. Mairal, J., Bach, F., Ponce, J., Sapiro, G., Zisserman, A.: Supervised dictionary learning. In: NIPS, CORR abs/0809.3083 (2008)

    Google Scholar 

  24. Yang, J., Yu, K., Huang, T.: Supervised translation-invariant sparse coding. In: IEEE CVPR, pp. 3517–3524 (2010)

    Google Scholar 

  25. Everingham, M., Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (voc) challenge. IJCV 88, 303–338 (2010)

    Article  Google Scholar 

  26. Chen, S., Cowan, C., Grant, P.: Orthogonal least squares learning algorithm for radial basis function networks. IEEE TNN 2, 302–309 (1991)

    Google Scholar 

  27. Golub, G.H., Hansen, P.C., O’Leary, D.P.: Tikhonov regularization and total least squares. SIAM J. Matrix Anal. Appl. 21, 185–194 (1999)

    Article  MathSciNet  MATH  Google Scholar 

  28. Levy, A., Lindenbaum, M.: Sequential Karhunen-Loeve basis extraction and its application to images. IEEE TIP 2, 456–460 (1998)

    Google Scholar 

  29. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE CVPR, vol. 1, pp. 886–893 (2005)

    Google Scholar 

  30. van de Sande, K.E.A., Uijlings, J.R.R., Gevers, T., Smeulders, A.W.M.: Segmentation as selective search for object recognition. In: IEEE CVPR, pp. 1879–1886 (2011)

    Google Scholar 

  31. Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge, VOC 2007 (2007), Results, http://www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html

  32. Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: IEEE CVPR, pp. 1–8 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Siyahjani, F., Doretto, G. (2013). Learning a Context Aware Dictionary for Sparse Representation. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds) Computer Vision – ACCV 2012. ACCV 2012. Lecture Notes in Computer Science, vol 7725. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37444-9_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37444-9_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37443-2

  • Online ISBN: 978-3-642-37444-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics