Bottom-up document segmentation method based on textural features

Vil’kin, A. M.; Safonov, I. V.; Egorova, M. A.

doi:10.1134/S1054661811021124

Bottom-up document segmentation method based on textural features

Applied Problems
Published: 13 September 2011

Volume 21, pages 565–568, (2011)
Cite this article

Pattern Recognition and Image Analysis Aims and scope Submit manuscript

A. M. Vil’kin¹,
I. V. Safonov¹ &
M. A. Egorova¹

94 Accesses
6 Citations
3 Altmetric
Explore all metrics

Abstract

A bottom-up approach to segmentation of a scanned document into background, text, and image regions is considered. The image is partitioned into blocks at the first step. A series of texture features is computed for each block. The block type is determined on the basis of these features. Different variants of block arrangement and size, 26 texture variables, and four block type classification algorithms have been considered. The block type is corrected on the basis of adjacent region analysis at the second step. The error matrix and ICDAR 2007 criterion are used for result estimation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Text Segmentation for Document Recognition

Page Segmentation Techniques in Document Analysis

Consensus-based clustering for document image segmentation

Article 21 September 2016

References

Zh. Lu, I. Bazzi, A. Kornai, J. Makhoul, P. Natarajan, R. Schwartz, and A. Robust, “Language-Independent OCR System,” Electron. Imaging: Proc. SPIE 3584 (1999).
R. L. de Queiroz, R. Buckley, and M. Xu, “Mixed Raster Content (MRC) Model for Compound Image Compression,” Proc. Int. Conf. Image Processing 3653, 1106–1117 (1999).
Google Scholar
J. J. Sauvola and M. Pietikäinen, “Page Segmentation and Classification Using Fast Feature Extraction and Connectivity Analysis,” in Proc. Int. Conf. on Document Analysis and Recognition (Montreal, 1995), pp. 1127–1131.
F. Wahl, K. Wong, and R. Casey, “Block Segmentation and Text Extraction in Mixed Text/Image Document,” Comp. Graphics Image Processing 20, 375–390 (1982).
Article Google Scholar
H. S. Baird, M. A. Moll, Chang An, and M. R. Casey, “Document Image Content Inventories,” in Proc. SPIE/IS&T Document Recognition & Retrieval XIV Conf. (San Jose, CA, 2007).
L. G. Shapiro and G. C. Stockman, Computer Vision (BINOM. Knowledge Laboratory, Moscow, 2006) [in Russian].
Google Scholar
F. Cesarini, S. Marinai, G. Soda, and M. Gori, “Structural Document Segmentation and Representation by the Modified X-Y Tree,” in Proc. Int. Conf. on Document Analysis and Recognition (Bangalore, 1999), p. 563.
http://graphics.cs.msu.ru/ru/science/research/machinelearning/adaboosttollbox
http://www.csie.ntu.edu.tw/~cjln/libswm
http://www.cs.umd.edu/~mount/ANN
http://leenissen.dk/fann
http://en.wikipedia.org/wiki/Confusion-matrix
A. Antonacopoulos, B. Gatos, and D. Bridson, “ICDAR2007 Page Segmentation Competition,” in Proc. ICDAR2007 (Curitiba, 2007), pp. 1279–1283.
B. A. Yanikoglu and L. Vincent, “Pink Parameter: A Complete Environment for Ground-Truthing and Benchmarking Document Page Segmentation,” Pattern Recogn. 31(9), 1191–1204 (1994).
Article Google Scholar
I. Phillips and A. Chhabra, “Empirical Performance Evaluation of Graphics Recognition Systems,” IEEE Trans. Pattern Anal. Mach. Intell. 21(9), 849–870 (1999).
Article Google Scholar
http://en.wikipedia.org/wiki/Precision_and_recall

Download references

Author information

Authors and Affiliations

National Research Nuclear University, MEPhI, Kashirskoe sh. 31, Moscow, 115409, Russia
A. M. Vil’kin, I. V. Safonov & M. A. Egorova

Authors

A. M. Vil’kin
View author publications
You can also search for this author in PubMed Google Scholar
I. V. Safonov
View author publications
You can also search for this author in PubMed Google Scholar
M. A. Egorova
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vil’kin, A.M., Safonov, I.V. & Egorova, M.A. Bottom-up document segmentation method based on textural features. Pattern Recognit. Image Anal. 21, 565–568 (2011). https://doi.org/10.1134/S1054661811021124

Download citation

Received: 29 December 2010
Published: 13 September 2011
Issue Date: September 2011
DOI: https://doi.org/10.1134/S1054661811021124

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Bottom-up document segmentation method based on textural features

Abstract

Access this article

Similar content being viewed by others

Text Segmentation for Document Recognition

Page Segmentation Techniques in Document Analysis

Consensus-based clustering for document image segmentation

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Bottom-up document segmentation method based on textural features

Abstract

Access this article

Similar content being viewed by others

Text Segmentation for Document Recognition

Page Segmentation Techniques in Document Analysis

Consensus-based clustering for document image segmentation

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation