Skip to main content

Advertisement

Log in

Saliency computation via whitened frequency band selection

  • Research Article
  • Published:
Cognitive Neurodynamics Aims and scope Submit manuscript

Abstract

Many saliency computational models have been proposed to simulate bottom-up visual attention mechanism of human visual system. However, most of them only deal with certain kinds of images or aim at specific applications. In fact, human beings have the ability to correctly select attentive focuses of objects with arbitrary sizes within any scenes. This paper proposes a new bottom-up computational model from the perspective of frequency domain based on the biological discovery of non-Classical Receptive Field (nCRF) in the retina. A saliency map can be obtained according to the idea of Extended Classical Receptive Field. The model is composed of three major steps: firstly decompose the input image into several feature maps representing different frequency bands that cover the whole frequency domain by utilizing Gabor wavelet. Secondly, whiten the feature maps to highlight the embedded saliency information. Thirdly, select some optimal maps, simulating the response of receptive field especially nCRF, to generate the saliency map. Experimental results show that the proposed algorithm is able to work with stable effect and outstanding performance in a variety of situations as human beings do and is adaptive to both psychological patterns and natural images. Beyond that, biological plausibility of nCRF and Gabor wavelet transform make this approach reliable.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  • Abutaleb AS (1989) Automatic thresholding of gray-level pictures using two-dimensional entropy. Comput Vis Graph Image Process 47(1):22–32

    Article  Google Scholar 

  • Achanta R, Hemami S, Estrada F, Susstrunk S (2009) Frequency-tuned salient region detection. In: IEEE conference on computer vision and pattern recognition (CVPR), 2009. IEEE, pp 1597–1604

  • Bian P, Zhang L (2010) Visual saliency: a biologically plausible contourlet-like frequency domain approach. Cogn Neurodyn 4(3):189–198

    Article  PubMed  PubMed Central  Google Scholar 

  • Bruce N, Tsotsos J (2005) Saliency based on information maximization. In: The proceedings of the neural information processing systems conference (NIPS 2005), Vancouver, British Columbia, Canada, pp 155–162

  • Cheng MM, Zhang GX, Mitra NJ, Huang X, Hu SM (2011) Global contrast based salient region detection. In: 2011 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 409–416

  • Garcia-Diaz A, Fdez-Vidal XR, Pardo XM, Dosil R (2012) Saliency from hierarchical adaptation through decorrelation and variance normalization. Image Vis Comput 30(1):51–64

    Article  Google Scholar 

  • Ghosh K, Sarkar S, Bhaumik K (2006) A possible explanation of the low-level brightness–contrast illusions in the light of an extended classical receptive field model of retinal ganglion cells. Biol Cybern 94(2):89–96

    Article  PubMed  Google Scholar 

  • Gu Y, Liljenström H (2007) A neural network model of attention-modulated neurodynamics. Cogn Neurodyn 1(4):275–285

    Article  PubMed  PubMed Central  Google Scholar 

  • Guo C, Ma Q, Zhang L (2008) Spatio-temporal saliency detection using phase spectrum of quaternion Fourier transform. In: IEEE conference on computer vision and pattern recognition (CVPR), 2008. IEEE, pp 1–8

  • Harel J, Koch C, Perona P (2007) Graph-based visual saliency. In: Advances in neural information processing systems, vol 19. MIT Press, Cambridge, pp 545–552

  • Hou X, Zhang L (2007) Saliency detection: a spectral residual approach. In: IEEE conference on computer vision and pattern recognition (CVPR), 2007. IEEE, pp 1–8

  • Itti L (2004) Automatic foveation for video compression using a neurobiological model of visual attention. IEEE Trans Image Process 13(10):1304–1318

    Article  PubMed  Google Scholar 

  • Itti L, Koch C, Niebur E (1998) A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Anal Mach Intell 20(11):1254–1259

    Article  Google Scholar 

  • Judd T, Ehinger K, Durand F, Torralba A (2009) Learning to predict where humans look. In: 2009 IEEE 12th international conference on computer vision. IEEE, pp 2106–2113

  • Koch C, Ullman S (1987) Shifts in selective visual attention: towards the underlying neural circuitry. In: Vaina LM (ed) Matters of intelligence. Springer, Netherlands, pp 115–141

  • Kootstra G, Nederveen A, De Boer B (2008) Paying attention to symmetry. In: Proceedings of the British machine vision conference (BMVC2008). The British Machine Vision Association and Society for Pattern Recognition, pp 1115–1125

  • Le Meur O, Le Callet P, Barba D (2007) Predicting visual fixations on video based on low-level visual features. Vis Res 47(19):2483–2498

    Article  PubMed  Google Scholar 

  • Li C, Zhou Y, Pei X, Qiu F, Tang C, Xu X (1992) Extensive disinhibitory region beyond the classical receptive field of cat retinal ganglion cells. Vis Res 32(2):219–228

    Article  CAS  PubMed  Google Scholar 

  • Li J, Levine MD, An X, Xu X, He H (2013) Visual saliency based on scale-space analysis in the frequency domain. IEEE Trans Pattern Anal Mach Intell 35(4):996–1010

    Article  PubMed  Google Scholar 

  • Liu T, Yuan Z, Sun J, Wang J, Zheng N, Tang X, Shum HY (2011) Learning to detect a salient object. IEEE Trans Pattern Anal Mach Intell 33(2):353–367

    Article  PubMed  Google Scholar 

  • Mahadevan V, Vasconcelos N (2009) Saliency-based discriminant tracking. In: IEEE conference on computer vision and pattern recognition, 2009. CVPR 2009. IEEE, pp 1007–1013

  • Mishra A, Aloimonos Y, Fah CL (2009) Active segmentation with fixation. In: 2009 IEEE 12th international conference on computer vision. IEEE, pp. 468–475

  • Oliva A, Torralba A (2001) Modeling the shape of the scene: a holistic representation of the spatial envelope. Int J Comput Vis 42(3):145–175

    Article  Google Scholar 

  • Perazzi F, Krahenbuhl P, Pritch Y, Hornung A (2012) Saliency filters: contrast based filtering for salient region detection. In: 2012 IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 733–740

  • Rodieck RW, Stone J (1965) Analysis of receptive fields of cat retinal ganglion cells. J Neurophysiol 28(5):833–849

    Google Scholar 

  • Shi X, Bruce ND, Tsotsos JK (2011) Fast, recurrent, attentional modulation improves saliency representation and scene recognition. In: 2011 IEEE computer society conference on computer vision and pattern recognition workshops (CVPRW). IEEE, pp 1–8

  • Tatler BW, Baddeley RJ, Gilchrist ID (2005) Visual correlates of fixation selection: effects of scale and time. Vis Res 45(5):643–659

    Article  PubMed  Google Scholar 

  • Treisman AM, Gelade G (1980) A feature-integration theory of attention. Cogn Psychol 12(1):97–136

    Article  CAS  PubMed  Google Scholar 

  • Walther D, Koch C (2006) Modeling attention to salient proto-objects. Neural Networks 19(9):1395–1407

    Article  PubMed  Google Scholar 

  • Wright MJ (1972) Functional organization of the periphery effect in retinal ganglion cells. Vis Res 12(11):1857-IN8

    Google Scholar 

  • Yang CW, Chung PC, Chang CI (1996) Hierarchical fast two-dimensional entropic thresholding algorithm using a histogram pyramid. Opt Eng 35(11):3227–3241

    Article  Google Scholar 

  • Yu Y, Wang B, Zhang L (2011) Bottom-up attention: pulsed PCA transform and pulsed cosine transform. Cogn Neurodyn 5(4):321–332

    Article  PubMed  PubMed Central  Google Scholar 

  • Zhang L, Tong MH, Marks TK, Shan H, Cottrell GW (2008) SUN: a Bayesian framework for saliency using natural statistics. J Vis 8(7):32

    Article  PubMed  Google Scholar 

  • Zou W, Kpalma K, Liu Z, Ronsin J (2013) Segmentation driven low-rank matrix recovery for saliency detection. In: 24th British machine vision conference (BMVC), pp 1–13

Download references

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China (Grant No. 61572133) and the Project from State Key Laboratory of Earth Surface Processes and Resource Ecology under Grant 2015-KF-01.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bin Wang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lv, Q., Wang, B. & Zhang, L. Saliency computation via whitened frequency band selection. Cogn Neurodyn 10, 255–267 (2016). https://doi.org/10.1007/s11571-015-9372-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11571-015-9372-y

Keywords

Navigation