Skip to main content
Log in

A class-independent flexible algorithm to generate region proposals

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Generating a sufficient number of regions with high accuracy is an important objective in the region proposal generation techniques. This paper presents a new, robust, and effective approach, which is based on the bottom-up segmentation, to produce a pool of well-quality regions. After image segmentation, the segmented candidates are expanded into the surrounding regions. The suggested algorithm produces some enlarged regions, which better cover objects and stuff. The proposed process can be applied in three different modes, namely fixed_mode, all_mode, and efficient_mode. The fixed_mode extends each region into parts of all the adjacent regions using an extension controller, which considers adjacent sequential pixels for each point on the region boundary. In all_mode, the current region is merged with all the adjacent regions to generate a larger region. The efficient_mode is then implemented using the accumulation of the results from both the fixed_mode and all_mode. Besides, the algorithm can be repeated in the fixed_mode and all_mode by considering a variety of values for the extension controller factor. No features are required to be extracted in the proposed algorithm, except for the image segmentation stage. In this study, four challenging datasets known as MSRC, VOC2007, VOC2012, and COCO 2017 are used to compare the proposed algorithm with other segmentation and region proposal algorithms. As a significant advantage compared to well-known region proposal algorithms, our approach achieves a greater Recall with the desirable number of regions. Furthermore, the algorithm shows a good improvementin extraction of small, medium, and large objects.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Achanta R, Shaji A, Smith K, Lucchi A, Fua P, Süsstrunk S (2012) Slic superpixels compared to state-of-the-art superpixel methods. IEEE transactions on pattern analysis and machine intelligence 34(11):2274–2282

    Article  Google Scholar 

  2. Alexe B, Deselaers T, Ferrari V (2010) What is an object?. In: 2010 IEEE Computer society conference on computer vision and pattern recognition, pp 73–80. IEEE

  3. Buluç A, Meyerhenke H, Safro I, Sanders P, Schulz C (2016) Recent advances in graph partitioning. In: Algorithm engineering, Springer, pp 117–158

  4. Carreira J, Sminchisescu C (2012) Cpmc: Automatic object segmentation using constrained parametric min-cuts. IEEE transactions on pattern analysis and machine intelligence 34(7):1312–1328

    Article  Google Scholar 

  5. Chen J, Luo B, Wu Q, Chen J, Peng X (2020) Overlap sampler for region-based object detection. In: The IEEE winter conference on applications of computer vision, pp 767–775

  6. Cheng M-M, Zhang Z, Lin W-Y, Torr P (2014) Bing: Binarized normed gradients for objectness estimation at 300fps. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3286–3293

  7. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 886–893

  8. Endres I, Hoiem D (2014) Category-independent object proposals with diverse ranking. IEEE transactions on pattern analysis and machine intelligence 36(2):222–234

    Article  Google Scholar 

  9. Everingham M, Eslami SM Ali, Van Gool L, Williams Christopher KI, Winn J, Zisserman A (2015) The pascal visual object classes challenge: A retrospective. International journal of computer vision 111(1):98–136

    Article  Google Scholar 

  10. Felzenszwalb P F, Girshick R B, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE transactions on pattern analysis and machine intelligence 32(9):1627–1645

    Article  Google Scholar 

  11. Felzenszwalb P F, Huttenlocher D P (2004) Efficient graph-based image segmentation. Int J Comp Vis 59(2):167–181

    Article  Google Scholar 

  12. Ghodrati A, Diba A, Pedersoli M, Tuytelaars T, Van Gool L (2015) Deepproposal: Hunting objects by cascading deep convolutional layers. In: Proceedings of the IEEE international conference on computer vision, pp 2578–2586

  13. Gidaris S, Komodakis N (2016) Attend refine repeat: Active box proposal generation via in-out localization. In: BMVC

  14. Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587

  15. Gu C, Lim JJ, Arbeláez P, Malik J (2009) Recognition using regions. In: 2009 IEEE Conference on computer vision and pattern recognition, pp 1030–1037. IEEE

  16. He K, Gkioxari G, Dollár P, Girshick R (2017) Mask r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969

  17. Hosang J, Benenson R, Dollár P, Schiele B (2016) What makes for effective detection proposals?. IEEE transactions on pattern analysis and machine intelligence 38(4):814–830

    Article  Google Scholar 

  18. Humayun A, Li F, Rehg JM (2014) Rigor: Reusing inference in graph cuts for generating object regions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 336– 343

  19. Jian M, Lam K-M, Dong J, Shen L (2014) Visual-patch-attention-aware saliency detection. IEEE Trans Cyber 45(8):1575–1586

    Article  Google Scholar 

  20. Jian M, Zhao R, Sun X, Luo H, Zhang W, Zhang H, Dong J, Yin Y, Lam K-M (2018) Saliency detection based on background seeds by object proposals and extended random walk. J Vis Commun Image Represent 57:202–211

    Article  Google Scholar 

  21. Kaya EC, Alatan AA (2018) Improving proposal-based object detection using convolutional context features. In: 2018 25th IEEE International conference on image processing (ICIP), pp 1308–1312. IEEE

  22. Ke W, Zhang T, Chen J, Wan F, Ye Q, Han Z (2016) Texture complexity based redundant regions ranking for object proposal. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 10–18

  23. Kim J, Grauman K (2012) Shape sharing for object segmentation. In: European conference on computer vision, pp 444–458. Springer

  24. Krähenbühl P, Koltun V (2014) Geodesic object proposals. In: European conference on computer vision, pp 725–739. Springer

  25. Kuo W, Hariharan B, Malik J (2015) Deepbox: Learning objectness with convolutional networks. In: Proceedings of the IEEE international conference on computer vision, pp 2479–2487

  26. Lampert CH, Blaschko MB, Hofmann T (2009) Efficient subwindow search: A branch and bound framework for object localization. IEEE transactions on pattern analysis and machine intelligence 31(12):2129–2142

    Article  Google Scholar 

  27. Li H, Liu Y, Ouyang W, Wang X (2019) Zoom out-and-in network with map attention decision for region proposal and object detection. Int J Comput Vis 127(3):225–238

    Article  Google Scholar 

  28. Li J, Liang X, Shen S, Xu T, Feng J, Yan S (2017) Scale-aware fast r-cnn for pedestrian detection. IEEE transactions on Multimedia 20 (4):985–996

    Google Scholar 

  29. Li S, Zhang H, Zhang J, Ren Y, Kuo C-C J (2017) Box refinement: Object proposal enhancement and pruning. In: 2017 IEEE winter conference on applications of computer vision (WACV), pp 979–988. IEEE

  30. Liao M, Shi B, Bai X, Wang X, Liu W (2017) Textboxes: A fast text detector with a single deep neural network. In: Thirty-first AAAI conference on artificial intelligence, pp 4161–4167

  31. Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125

  32. Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: Common objects in context. In: European conference on computer vision, pp 740–755. Springer

  33. Manen S, Guillaumin M, Van Gool L (2013) Prime object proposals with randomized prim’s algorithm. In: Proceedings of the IEEE international conference on computer vision, pp 2536–2543

  34. Mordan T, Thome N, Henaff G, Cord M (2019) End-to-end learning of latent deformable part-based representations for object detection. Int J Comput Vis 127(11-12):1659–1679

    Article  Google Scholar 

  35. Najibi M, Singh B, Davis LS (2019) Fa-rpn: Floating region proposals for face detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7723–7732

  36. Pinheiro PO, Collobert R, Dollár P (2015) Learning to segment object candidates. Advances in neural information processing systems 28:1990–1998

    Google Scholar 

  37. Pinheiro PO, Lin T-Y, Collobert R, Dollár P (2016) Learning to refine object segments. In: European conference on computer vision, pp 75–91. Springer

  38. Pont-Tuset J, Arbelaez P, Barron JT, Marques F, Malik J (2017) Multiscale combinatorial grouping for image segmentation and object proposal generation. IEEE transactions on pattern analysis and machine intelligence 39(1):128–140

    Article  Google Scholar 

  39. Rahtu E, Kannala J, Blaschko M (2011) Learning a category independent object detection cascade. In: 2011 International conference on computer vision, pp 1052–1059. IEEE

  40. Rantalankila P, Kannala J, Rahtu E (2014) Generating object segmentation proposals using global and local search. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2417–2424

  41. Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99

  42. Shotton J, Winn J, Rother C, Criminisi A (2006) Textonboost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation. In: European conference on computer vision, pp 1–15. Springer

  43. Sorkhi AG, Hassanpour H, Fateh M (2020) A comprehensive system for image scene classification. Multi Tools App 79(25):1–26

    Google Scholar 

  44. Taghizadeh M, Chalechale A (2020) Region proposal generation: A hierarchical merging similarity-based algorithm. In: 2020 International conference on machine vision and image processing (MVIP), pp 1–6. IEEE

  45. Tang Y, Wang X, Dellandréa E, Chen L (2017) Weakly supervised learning of deformable part-based models for object detection via region proposals. IEEE Trans Multi 19(2):393–407

    Article  Google Scholar 

  46. Uijlings Jasper RR, Van De Sande Koen EA, Gevers T, Smeulders Arnold WM (2013) Selective search for object recognition. Int J Comp Vis 104 (2):154–171

    Article  Google Scholar 

  47. Vedaldi A, Soatto S (2008) Quick shift and kernel methods for mode seeking. In: European conference on computer vision, pp 705–718. Springer

  48. Vu T, Jang H, Pham TX, Yoo C (2019) Cascade rpn: Delving into high-quality region proposal network with adaptive convolution. In: Advances in neural information processing systems, pp 1430–1440

  49. Wang J, Chen K, Yang S, Loy CC, Lin D (2019) Region proposal by guided anchoring. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2965–2974

  50. Zhang Z, Liu Y, Chen X, Zhu Y, Cheng M-M, Saligrama V, Torr Philip HS (2018) Sequential optimization for efficient high-quality object proposal generation. IEEE transactions on pattern analysis and machine intelligence 40(5):1209–1223

    Article  Google Scholar 

  51. Zitnick CL, Dollár P (2014) Edge boxes: Locating object proposals from edges. In: European conference on computer vision, pp 391–405. Springer

Download references

Acknowledgements

The authors would like to grateful thank Dr. Ali Jannesari who is with the Department of Computer Science at Iowa State University, USA, for his valuable help in this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abdolah Chalechale.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Taghizadeh, M., Chalechale, A. A class-independent flexible algorithm to generate region proposals. Multimed Tools Appl 80, 24697–24717 (2021). https://doi.org/10.1007/s11042-021-10826-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-021-10826-y

Keywords

Navigation