Interval type-2 fuzzy set and human vision based multi-scale geometric analysis for text-graphics segmentation

Dhar, Soumyadip; Kundu, Malay K.

doi:10.1007/s11042-019-7649-6

Interval type-2 fuzzy set and human vision based multi-scale geometric analysis for text-graphics segmentation

Published: 27 April 2019

Volume 78, pages 22939–22957, (2019)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

155 Accesses
5 Citations
Explore all metrics

Abstract

This paper presents a novel method for texture-based text-graphic segmentation in a text embedded image. In the method, features are computed applying Multi-scale Geometric Analysis(MGA). The MGA of the image is done by Nonsubsampled contourlet transform(NSCT). The NSCT sub-bands help to generate the features which represent textures of the text portions and graphics portions of the image. In a segmentation process, the uncertainties arise mainly for two reasons: one is the ambiguity in gray level and other is the spatial ambiguity. Here the uncertainties are managed by interval type2 fuzzy set (IT2FS). The human vision model called human psychovisual phenomenon (HVS) is incorporated in the process for generating the interval type-2 fuzzy membership functions (IT2FMF). The efficiency of the proposed scheme is measured on the benchmark dataset. The robustness and performance bound of the proposed technique under noise corruption are measured statistically using modified Cramer-Rao bound. We found that effectiveness of the features by NSCT in combination with the IT2FS are quite promising in comparison to the state-of-the-arts methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Rough-Fuzzy Clustering and M-Band Wavelet Packet for Text-Graphics Segmentation

Comparative Analysis of Multi-scale Wavelet Decomposition and k-Means Clustering Based Text Extraction

Article 18 May 2019

A New Threshold Selection Method Based on Fuzzy Expert Systems for Separating Text from the Background of Document Images

Article 29 October 2018

References

Acharyya M, Kundu MK (2001) Adaptive basis selection for multitexture segmentation by m-band wavelet packet frames. Proc Int Conf Image Process 2:622–625
Acharyya M, Kundu MK (2002) Document image segmentation using wavelet scale-space features. IEEE Trans Circ Syst Video Technol 12(12):1117–1127
Article Google Scholar
Bai B, Yin F, Liu CL (2014) A seed-based segmentation method for scene text extraction. IAPR International Workshop on Document Analysis Systems, pp 262–266
Bustince H, Barrenechea E, Pagola M (2006) Restricted equivalence functions. Fuzzy Sets Syst 157:2333–2346
Article MathSciNet MATH Google Scholar
Candes EJ, Donoho DL (2003) New tight frames of curvelets and optimal representations of objects with singularities. Commun Pur Appl Math 57:219–266
Article MathSciNet MATH Google Scholar
Chan W, Coghill G (2001) Text analysis using local energy. Pattern Recogn 34:2523–2532
Article MATH Google Scholar
Chen D, Odobez J-M, Bourlard H (2004) Text detection and recognition in images and video frames. Pattern Recogn 37:595–608
Article Google Scholar
Cho H, Sung M, Jun B (2016) Canny text detector: Fast and robust scene text localization algorithm. In: International conference on computer vision and pattern recognition (CVPR), pp 3566–3573
Da Cunha AL, Zhou J, Do MN (2006) The nonsubsampled contourlet transform: theory, design and applications. IEEE Trans Image Process 15(10):3089–3101
Article Google Scholar
Dhar S, Kundu MK (2017) Accurate segmentation of complex document image using digital shearlet transform with neutrosophic set as uncertainty handling tool. Appl Soft Comput 61:412–426
Article Google Scholar
Do MN, Vetterli M (2005) The contourlet transform: an efficient directional multiresolution image representation. Proc Br Mach Vis Conf 14(12):2091–2106
Google Scholar
Everingham M, Gool LV, Williams CKI, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. Int J Comput Vis 88:303–338
Article Google Scholar
Grana C, Serra G, Manfredi M, Coppi D, Cucchiara R (2016) Layout analysis and content enrichment of digitized books. Multimed Tools Appl 75 (7):3879–3900
Article Google Scholar
Gomez L, Karatzas D (2013) Multi-script text extraction from natural scenes. In: Proceedings of international conference on document analysis and recognition, pp 467–471
Hartnett K (2019) Foundations built for a general theory of neural networks. Quanta magazine
He T, Huang W, Qiao Y, Yao J (2016) Text-attentional convolutional neural network for scene text detection. IEEE Trans Image Process 25:2529–2541
Article MathSciNet MATH Google Scholar
He P, Huang W, He T, Zhu Q, Qiao Y, Li X (2017) Single shot text detector with regional attention. In: Proceedings of international conference on computer vision (ICCV)
Hosseini S, Lee SH, Cho NI (2018) Feeding hand-crafted features for enhancing the performance of convolutional neural networks. arXiv:1801.07848
Huang W, Lin Z, Yang J, Wang J (2013) Text localization in natural images using stroke feature transform and text covariance descriptors. In: Proceedings of the IEEE international conference on computer vision, pp 1241–1248
Huang W, Yu Q, Tang X (2014) Robust scene text detection with convolution neural network induced MSER trees. Proc Eur Conf Comput Vis, LNCS 8692:497–511
Google Scholar
ICDAR2015 dataset. http://rrc.cvc.uab.es/ (2015)
Kaist scene text database. www.iapr-tc11.org/mediawiki/index.php/KAIST_Scene_Text_Database (2011)
Karnik NN, Mendel JM (1989) Introduction to type-2 fuzzy logic systems. In: Proceeding of International Conference on Fuzzy Systems, pp 915–920
Kim SH, An KJ, Jang SW, Kim GY (2016) Texture feature-based text region segmentation in social multimedia data. Multimed Tools Appl 75(20):12815–12829
Article Google Scholar
Kobchaisawat T, Chalidabhongse TH (2014) Thai text localization in natural scene images using convolutional neural network. Signal and Information Processing Association Annual Summit and conference (APSIPA)
Kundu MK, Pal S (1986) Thresholding for edge detection using human psychovisual phenomena. Pattern Recogn Lett 4(6):433–441
Article Google Scholar
Kundu MK, Acharyya M (2003) M-band wavelets:application to texture segmentation for real life image analysis. Int J Wavelets, Multiresolution Inf Process 1 (1):115–119
Article MATH Google Scholar
Kundu MK, Dhar S, Banerjee M (2012) A new approch for segmentation of image and text in natural and commercial text documents. In: Proceedings of international conference on communications, devices and intelligent system, pp 86–88
Lan Z, Yu S-I, Yao D, Lin M, Raj B, Hauptmann A (2016) The best of both worlds: Combining data-independent and data-driven approaches for action recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 123–132
Li Y, Lu H (2012) Scene text detection via stroke width. In: Proceedings of international conference on pattern recognition, pp 681–684
Liang CW, Chen PY (2004) Dwt based text International localization. J Appl Sci Eng 2:105–116
Google Scholar
Liu Z, Li Y, Qi X, Yang Y, Nian M, Zhang H, Xiamixiding R (2017) Method for unconstrained text detection in natural scene image. IET Computer Vision
Long S, He X, Ya C (2018) Scene text detection and recognition: The deep learning era. arXiv:1811.04256
Lucas SM (2005) ICDAR2005 text locating competition results. Proc Int Conf Doc Anal Recogn 1:80–84
Google Scholar
López F, Valiente J, Baldrich R, Vanrell M (2005) Fast surface grading using color statistics in the cie lab space. In: Iberian conference on pattern recognition and image analysis, pp 666–673
Maji P, Roy S (2015) Rough -fuzzy clustering and multiresolution image analysis for text-graphics segmentation. Appl Soft Comput 30:705–721
Article Google Scholar
Mitra P, Murty CA, Pal S (2002) Unsupervised feature selection using feature similarity. IEEE Trans Pattern Anal Mach Intell 24(3):301–312
Article Google Scholar
Murthy CA, Pal S (1992) Histogram thresholding by minimizing gray level fuzziness. Inf Sci 60:107–135
Article MATH Google Scholar
Nagg G, Seth S, Viswanathan M (1992) A protoptype document image analysis system for technical journals. Computer 25:10–22
Article Google Scholar
Nguyen DT, Pham TD, Na RB, Park KR (2018) Combining deep and handcrafted image features for presentation attack detection in face recognition systems using visible-light camera sensors. Sensors 18(3):699
Article Google Scholar
Park J, Lee G, Kim E, Lim J, Kim S, Yang H, Lee M, Hwang S (2010) Automatic detection and recognition of korean text in outdoor signboard images. Pattern Recogn Lett 31(12):1728–1739
Article Google Scholar
Pennec EL, Mallat S (2000) Image compression with geometrical wavelets. In: 2000 Proceedings. 2000 International Conference on Image Processing, vol 1, pp 661–664
Peng R, Varshney PK (2015) On performance limit of image segmentation algorithms. Comput Vis Image Underst 132:24–38
Article Google Scholar
Rosenfield A (1998) Fuzzy geometry: an updated overview. Inf Sci 110:127–133
Article MathSciNet Google Scholar
Roy S, Kundu MK, Granlund GH (1996) Uncertainty relations and time-frequency distributions for unsharp observables. Inf Sci 89:193–209
Article MATH Google Scholar
Shi C, Wang C, Xiao B, Zhang Y (2013) Scene text detection using graph model built upon maximally stable extremal regions. Pattren Recogn Letter 34:107–116
Article Google Scholar
Szmidt E, Kacprzyk J (2001) Entropy for intuitionistic fuzzy sets. Fuzzy Sets Syst 118(3):467–477
Article MathSciNet MATH Google Scholar
Tian Z, Huang W, He T, He P, Qiao Y (2016) Detecting text in natural image with connectionist text proposal network. In: Proceedings of European conference on computer vision (ECCV)
Wei Y, Zhang Z, Shen W, Zeng D, Fang M, Zhou S (2017) Text detection in scene images based on exhaustive segmentation. Signal Process Image Commun 50:1–8
Article Google Scholar
Yang H, Quehl B, Sack H (2014) A framework for improved video text detection and recognition. Multimed Tools Appl 69(1):217–245
Article Google Scholar
Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE, pp 1083–1090
Yi C, Tian YL (2011) Text string detection from natural scenes by structure-based partition and grouping. IEEE Trans Image Process 20(9):2594–2605
Article MathSciNet MATH Google Scholar
Yin X, Yin X, Hao HW, Hung K (2014) Robust text detection in natural scene images. IEEE Trans Pattern Anal Mach Intell 36:970–983
Article Google Scholar
Zhu A, Wang G, Dong Y (2015) Detecting natural scenes text via auto image partition, two-stage grouping and two-layer classification. Pattern Recogn Lett 67:153–162
Article Google Scholar
Zhu Y, Yao C, Bai X (2016) Scene text detection and recognition:recent advances and future trends. Front Comput Sci 10:19–36
Article Google Scholar
Zhu W, Chen Q, Wei C, Li Z (2017) A segmentation algorithm based on image projection for complex text layout. In: AIP Conference Proceedings. AIP Publishing, vol 1890, pp 030011

Download references

Author information

Authors and Affiliations

RCCIIT, Beliaghata, Kolkata, 700010, India
Soumyadip Dhar
ISI, B.T. Road, Kolkata, 700108, India
Malay K. Kundu

Authors

Soumyadip Dhar
View author publications
You can also search for this author in PubMed Google Scholar
Malay K. Kundu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Soumyadip Dhar.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dhar, S., Kundu, M.K. Interval type-2 fuzzy set and human vision based multi-scale geometric analysis for text-graphics segmentation. Multimed Tools Appl 78, 22939–22957 (2019). https://doi.org/10.1007/s11042-019-7649-6

Download citation

Received: 01 May 2018
Revised: 14 February 2019
Accepted: 15 April 2019
Published: 27 April 2019
Issue Date: 30 August 2019
DOI: https://doi.org/10.1007/s11042-019-7649-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Interval type-2 fuzzy set and human vision based multi-scale geometric analysis for text-graphics segmentation

Abstract

Access this article

Similar content being viewed by others

Rough-Fuzzy Clustering and M-Band Wavelet Packet for Text-Graphics Segmentation

Comparative Analysis of Multi-scale Wavelet Decomposition and k-Means Clustering Based Text Extraction

A New Threshold Selection Method Based on Fuzzy Expert Systems for Separating Text from the Background of Document Images

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Interval type-2 fuzzy set and human vision based multi-scale geometric analysis for text-graphics segmentation

Abstract

Access this article

Similar content being viewed by others

Rough-Fuzzy Clustering and M-Band Wavelet Packet for Text-Graphics Segmentation

Comparative Analysis of Multi-scale Wavelet Decomposition and k-Means Clustering Based Text Extraction

A New Threshold Selection Method Based on Fuzzy Expert Systems for Separating Text from the Background of Document Images

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation