Paper
17 January 2005 Historical document image segmentation using background light intensity normalization
Author Affiliations +
Proceedings Volume 5676, Document Recognition and Retrieval XII; (2005) https://doi.org/10.1117/12.585545
Event: Electronic Imaging 2005, 2005, San Jose, California, United States
Abstract
This paper presents a new document binarization algorithm for camera images of historical documents, which are especially found in The Library of Congress of the United States. The algorithm uses a background light intensity normalization algorithm to enhance an image before a local adaptive binarization algorithm is applied. The image normalization algorithm uses an adaptive linear or non-linear function to approximate the uneven background of the image due to the uneven surface of the document paper, aged color or uneven light source of the cameras for image lifting. Our algorithm adaptively captures the background of a document image with a "best fit" approximation. The document image is then normalized with respect to the approximation before a thresholding algorithm is applied. The technique works for both gray scale and color historical handwritten document images with significant improvement in readability for both human and OCR.
© (2005) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Zhixin Shi and Venu Govindaraju "Historical document image segmentation using background light intensity normalization", Proc. SPIE 5676, Document Recognition and Retrieval XII, (17 January 2005); https://doi.org/10.1117/12.585545
Lens.org Logo
CITATIONS
Cited by 26 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image enhancement

Image segmentation

Detection and tracking algorithms

Image processing

Image quality

Image processing algorithms and systems

Optical character recognition

Back to Top