Abstract
A bottom-up approach to segmentation of a scanned document into background, text, and image regions is considered. The image is partitioned into blocks at the first step. A series of texture features is computed for each block. The block type is determined on the basis of these features. Different variants of block arrangement and size, 26 texture variables, and four block type classification algorithms have been considered. The block type is corrected on the basis of adjacent region analysis at the second step. The error matrix and ICDAR 2007 criterion are used for result estimation.
Similar content being viewed by others
References
Zh. Lu, I. Bazzi, A. Kornai, J. Makhoul, P. Natarajan, R. Schwartz, and A. Robust, “Language-Independent OCR System,” Electron. Imaging: Proc. SPIE 3584 (1999).
R. L. de Queiroz, R. Buckley, and M. Xu, “Mixed Raster Content (MRC) Model for Compound Image Compression,” Proc. Int. Conf. Image Processing 3653, 1106–1117 (1999).
J. J. Sauvola and M. Pietikäinen, “Page Segmentation and Classification Using Fast Feature Extraction and Connectivity Analysis,” in Proc. Int. Conf. on Document Analysis and Recognition (Montreal, 1995), pp. 1127–1131.
F. Wahl, K. Wong, and R. Casey, “Block Segmentation and Text Extraction in Mixed Text/Image Document,” Comp. Graphics Image Processing 20, 375–390 (1982).
H. S. Baird, M. A. Moll, Chang An, and M. R. Casey, “Document Image Content Inventories,” in Proc. SPIE/IS&T Document Recognition & Retrieval XIV Conf. (San Jose, CA, 2007).
L. G. Shapiro and G. C. Stockman, Computer Vision (BINOM. Knowledge Laboratory, Moscow, 2006) [in Russian].
F. Cesarini, S. Marinai, G. Soda, and M. Gori, “Structural Document Segmentation and Representation by the Modified X-Y Tree,” in Proc. Int. Conf. on Document Analysis and Recognition (Bangalore, 1999), p. 563.
http://graphics.cs.msu.ru/ru/science/research/machinelearning/adaboosttollbox
A. Antonacopoulos, B. Gatos, and D. Bridson, “ICDAR2007 Page Segmentation Competition,” in Proc. ICDAR2007 (Curitiba, 2007), pp. 1279–1283.
B. A. Yanikoglu and L. Vincent, “Pink Parameter: A Complete Environment for Ground-Truthing and Benchmarking Document Page Segmentation,” Pattern Recogn. 31(9), 1191–1204 (1994).
I. Phillips and A. Chhabra, “Empirical Performance Evaluation of Graphics Recognition Systems,” IEEE Trans. Pattern Anal. Mach. Intell. 21(9), 849–870 (1999).
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Vil’kin, A.M., Safonov, I.V. & Egorova, M.A. Bottom-up document segmentation method based on textural features. Pattern Recognit. Image Anal. 21, 565–568 (2011). https://doi.org/10.1134/S1054661811021124
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S1054661811021124