Skip to main content
Log in

Bottom-up document segmentation method based on textural features

  • Applied Problems
  • Published:
Pattern Recognition and Image Analysis Aims and scope Submit manuscript

Abstract

A bottom-up approach to segmentation of a scanned document into background, text, and image regions is considered. The image is partitioned into blocks at the first step. A series of texture features is computed for each block. The block type is determined on the basis of these features. Different variants of block arrangement and size, 26 texture variables, and four block type classification algorithms have been considered. The block type is corrected on the basis of adjacent region analysis at the second step. The error matrix and ICDAR 2007 criterion are used for result estimation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Zh. Lu, I. Bazzi, A. Kornai, J. Makhoul, P. Natarajan, R. Schwartz, and A. Robust, “Language-Independent OCR System,” Electron. Imaging: Proc. SPIE 3584 (1999).

  2. R. L. de Queiroz, R. Buckley, and M. Xu, “Mixed Raster Content (MRC) Model for Compound Image Compression,” Proc. Int. Conf. Image Processing 3653, 1106–1117 (1999).

    Google Scholar 

  3. J. J. Sauvola and M. Pietikäinen, “Page Segmentation and Classification Using Fast Feature Extraction and Connectivity Analysis,” in Proc. Int. Conf. on Document Analysis and Recognition (Montreal, 1995), pp. 1127–1131.

  4. F. Wahl, K. Wong, and R. Casey, “Block Segmentation and Text Extraction in Mixed Text/Image Document,” Comp. Graphics Image Processing 20, 375–390 (1982).

    Article  Google Scholar 

  5. H. S. Baird, M. A. Moll, Chang An, and M. R. Casey, “Document Image Content Inventories,” in Proc. SPIE/IS&T Document Recognition & Retrieval XIV Conf. (San Jose, CA, 2007).

  6. L. G. Shapiro and G. C. Stockman, Computer Vision (BINOM. Knowledge Laboratory, Moscow, 2006) [in Russian].

    Google Scholar 

  7. F. Cesarini, S. Marinai, G. Soda, and M. Gori, “Structural Document Segmentation and Representation by the Modified X-Y Tree,” in Proc. Int. Conf. on Document Analysis and Recognition (Bangalore, 1999), p. 563.

  8. http://graphics.cs.msu.ru/ru/science/research/machinelearning/adaboosttollbox

  9. http://www.csie.ntu.edu.tw/~cjln/libswm

  10. http://www.cs.umd.edu/~mount/ANN

  11. http://leenissen.dk/fann

  12. http://en.wikipedia.org/wiki/Confusion-matrix

  13. A. Antonacopoulos, B. Gatos, and D. Bridson, “ICDAR2007 Page Segmentation Competition,” in Proc. ICDAR2007 (Curitiba, 2007), pp. 1279–1283.

  14. B. A. Yanikoglu and L. Vincent, “Pink Parameter: A Complete Environment for Ground-Truthing and Benchmarking Document Page Segmentation,” Pattern Recogn. 31(9), 1191–1204 (1994).

    Article  Google Scholar 

  15. I. Phillips and A. Chhabra, “Empirical Performance Evaluation of Graphics Recognition Systems,” IEEE Trans. Pattern Anal. Mach. Intell. 21(9), 849–870 (1999).

    Article  Google Scholar 

  16. http://en.wikipedia.org/wiki/Precision_and_recall

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vil’kin, A.M., Safonov, I.V. & Egorova, M.A. Bottom-up document segmentation method based on textural features. Pattern Recognit. Image Anal. 21, 565–568 (2011). https://doi.org/10.1134/S1054661811021124

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1134/S1054661811021124

Keywords

Navigation