Abstract
Generating a sufficient number of regions with high accuracy is an important objective in the region proposal generation techniques. This paper presents a new, robust, and effective approach, which is based on the bottom-up segmentation, to produce a pool of well-quality regions. After image segmentation, the segmented candidates are expanded into the surrounding regions. The suggested algorithm produces some enlarged regions, which better cover objects and stuff. The proposed process can be applied in three different modes, namely fixed_mode, all_mode, and efficient_mode. The fixed_mode extends each region into parts of all the adjacent regions using an extension controller, which considers adjacent sequential pixels for each point on the region boundary. In all_mode, the current region is merged with all the adjacent regions to generate a larger region. The efficient_mode is then implemented using the accumulation of the results from both the fixed_mode and all_mode. Besides, the algorithm can be repeated in the fixed_mode and all_mode by considering a variety of values for the extension controller factor. No features are required to be extracted in the proposed algorithm, except for the image segmentation stage. In this study, four challenging datasets known as MSRC, VOC2007, VOC2012, and COCO 2017 are used to compare the proposed algorithm with other segmentation and region proposal algorithms. As a significant advantage compared to well-known region proposal algorithms, our approach achieves a greater Recall with the desirable number of regions. Furthermore, the algorithm shows a good improvementin extraction of small, medium, and large objects.
Similar content being viewed by others
References
Achanta R, Shaji A, Smith K, Lucchi A, Fua P, Süsstrunk S (2012) Slic superpixels compared to state-of-the-art superpixel methods. IEEE transactions on pattern analysis and machine intelligence 34(11):2274–2282
Alexe B, Deselaers T, Ferrari V (2010) What is an object?. In: 2010 IEEE Computer society conference on computer vision and pattern recognition, pp 73–80. IEEE
Buluç A, Meyerhenke H, Safro I, Sanders P, Schulz C (2016) Recent advances in graph partitioning. In: Algorithm engineering, Springer, pp 117–158
Carreira J, Sminchisescu C (2012) Cpmc: Automatic object segmentation using constrained parametric min-cuts. IEEE transactions on pattern analysis and machine intelligence 34(7):1312–1328
Chen J, Luo B, Wu Q, Chen J, Peng X (2020) Overlap sampler for region-based object detection. In: The IEEE winter conference on applications of computer vision, pp 767–775
Cheng M-M, Zhang Z, Lin W-Y, Torr P (2014) Bing: Binarized normed gradients for objectness estimation at 300fps. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3286–3293
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 886–893
Endres I, Hoiem D (2014) Category-independent object proposals with diverse ranking. IEEE transactions on pattern analysis and machine intelligence 36(2):222–234
Everingham M, Eslami SM Ali, Van Gool L, Williams Christopher KI, Winn J, Zisserman A (2015) The pascal visual object classes challenge: A retrospective. International journal of computer vision 111(1):98–136
Felzenszwalb P F, Girshick R B, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE transactions on pattern analysis and machine intelligence 32(9):1627–1645
Felzenszwalb P F, Huttenlocher D P (2004) Efficient graph-based image segmentation. Int J Comp Vis 59(2):167–181
Ghodrati A, Diba A, Pedersoli M, Tuytelaars T, Van Gool L (2015) Deepproposal: Hunting objects by cascading deep convolutional layers. In: Proceedings of the IEEE international conference on computer vision, pp 2578–2586
Gidaris S, Komodakis N (2016) Attend refine repeat: Active box proposal generation via in-out localization. In: BMVC
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Gu C, Lim JJ, Arbeláez P, Malik J (2009) Recognition using regions. In: 2009 IEEE Conference on computer vision and pattern recognition, pp 1030–1037. IEEE
He K, Gkioxari G, Dollár P, Girshick R (2017) Mask r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969
Hosang J, Benenson R, Dollár P, Schiele B (2016) What makes for effective detection proposals?. IEEE transactions on pattern analysis and machine intelligence 38(4):814–830
Humayun A, Li F, Rehg JM (2014) Rigor: Reusing inference in graph cuts for generating object regions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 336– 343
Jian M, Lam K-M, Dong J, Shen L (2014) Visual-patch-attention-aware saliency detection. IEEE Trans Cyber 45(8):1575–1586
Jian M, Zhao R, Sun X, Luo H, Zhang W, Zhang H, Dong J, Yin Y, Lam K-M (2018) Saliency detection based on background seeds by object proposals and extended random walk. J Vis Commun Image Represent 57:202–211
Kaya EC, Alatan AA (2018) Improving proposal-based object detection using convolutional context features. In: 2018 25th IEEE International conference on image processing (ICIP), pp 1308–1312. IEEE
Ke W, Zhang T, Chen J, Wan F, Ye Q, Han Z (2016) Texture complexity based redundant regions ranking for object proposal. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 10–18
Kim J, Grauman K (2012) Shape sharing for object segmentation. In: European conference on computer vision, pp 444–458. Springer
Krähenbühl P, Koltun V (2014) Geodesic object proposals. In: European conference on computer vision, pp 725–739. Springer
Kuo W, Hariharan B, Malik J (2015) Deepbox: Learning objectness with convolutional networks. In: Proceedings of the IEEE international conference on computer vision, pp 2479–2487
Lampert CH, Blaschko MB, Hofmann T (2009) Efficient subwindow search: A branch and bound framework for object localization. IEEE transactions on pattern analysis and machine intelligence 31(12):2129–2142
Li H, Liu Y, Ouyang W, Wang X (2019) Zoom out-and-in network with map attention decision for region proposal and object detection. Int J Comput Vis 127(3):225–238
Li J, Liang X, Shen S, Xu T, Feng J, Yan S (2017) Scale-aware fast r-cnn for pedestrian detection. IEEE transactions on Multimedia 20 (4):985–996
Li S, Zhang H, Zhang J, Ren Y, Kuo C-C J (2017) Box refinement: Object proposal enhancement and pruning. In: 2017 IEEE winter conference on applications of computer vision (WACV), pp 979–988. IEEE
Liao M, Shi B, Bai X, Wang X, Liu W (2017) Textboxes: A fast text detector with a single deep neural network. In: Thirty-first AAAI conference on artificial intelligence, pp 4161–4167
Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125
Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: Common objects in context. In: European conference on computer vision, pp 740–755. Springer
Manen S, Guillaumin M, Van Gool L (2013) Prime object proposals with randomized prim’s algorithm. In: Proceedings of the IEEE international conference on computer vision, pp 2536–2543
Mordan T, Thome N, Henaff G, Cord M (2019) End-to-end learning of latent deformable part-based representations for object detection. Int J Comput Vis 127(11-12):1659–1679
Najibi M, Singh B, Davis LS (2019) Fa-rpn: Floating region proposals for face detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7723–7732
Pinheiro PO, Collobert R, Dollár P (2015) Learning to segment object candidates. Advances in neural information processing systems 28:1990–1998
Pinheiro PO, Lin T-Y, Collobert R, Dollár P (2016) Learning to refine object segments. In: European conference on computer vision, pp 75–91. Springer
Pont-Tuset J, Arbelaez P, Barron JT, Marques F, Malik J (2017) Multiscale combinatorial grouping for image segmentation and object proposal generation. IEEE transactions on pattern analysis and machine intelligence 39(1):128–140
Rahtu E, Kannala J, Blaschko M (2011) Learning a category independent object detection cascade. In: 2011 International conference on computer vision, pp 1052–1059. IEEE
Rantalankila P, Kannala J, Rahtu E (2014) Generating object segmentation proposals using global and local search. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2417–2424
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99
Shotton J, Winn J, Rother C, Criminisi A (2006) Textonboost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation. In: European conference on computer vision, pp 1–15. Springer
Sorkhi AG, Hassanpour H, Fateh M (2020) A comprehensive system for image scene classification. Multi Tools App 79(25):1–26
Taghizadeh M, Chalechale A (2020) Region proposal generation: A hierarchical merging similarity-based algorithm. In: 2020 International conference on machine vision and image processing (MVIP), pp 1–6. IEEE
Tang Y, Wang X, Dellandréa E, Chen L (2017) Weakly supervised learning of deformable part-based models for object detection via region proposals. IEEE Trans Multi 19(2):393–407
Uijlings Jasper RR, Van De Sande Koen EA, Gevers T, Smeulders Arnold WM (2013) Selective search for object recognition. Int J Comp Vis 104 (2):154–171
Vedaldi A, Soatto S (2008) Quick shift and kernel methods for mode seeking. In: European conference on computer vision, pp 705–718. Springer
Vu T, Jang H, Pham TX, Yoo C (2019) Cascade rpn: Delving into high-quality region proposal network with adaptive convolution. In: Advances in neural information processing systems, pp 1430–1440
Wang J, Chen K, Yang S, Loy CC, Lin D (2019) Region proposal by guided anchoring. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2965–2974
Zhang Z, Liu Y, Chen X, Zhu Y, Cheng M-M, Saligrama V, Torr Philip HS (2018) Sequential optimization for efficient high-quality object proposal generation. IEEE transactions on pattern analysis and machine intelligence 40(5):1209–1223
Zitnick CL, Dollár P (2014) Edge boxes: Locating object proposals from edges. In: European conference on computer vision, pp 391–405. Springer
Acknowledgements
The authors would like to grateful thank Dr. Ali Jannesari who is with the Department of Computer Science at Iowa State University, USA, for his valuable help in this work.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Taghizadeh, M., Chalechale, A. A class-independent flexible algorithm to generate region proposals. Multimed Tools Appl 80, 24697–24717 (2021). https://doi.org/10.1007/s11042-021-10826-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-10826-y