Text Localization and Recognition in Complex Scenes Using Local Features

Zheng, Qi; Chen, Kai; Zhou, Yi; Gu, Congcong; Guan, Haibing

doi:10.1007/978-3-642-19318-7_10

Qi Zheng¹⁹,
Kai Chen¹⁹,
Yi Zhou¹⁹,
Congcong Gu¹⁹ &
…
Haibing Guan²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6494))

Included in the following conference series:

Asian Conference on Computer Vision

3115 Accesses
5 Citations

Abstract

We describe an approach using local features to resolve problems in text localization and recognition in complex scenes. Low image quality, complex background and variations of text make these problems challenging. Our approach includes the following stages: (1) Template images are generated automatically; (2) SIFT features are extracted and matched to template images; (3) Multiple single-character-areas are located using segmentation algorithm based upon multiple-size sliding sub-windows; (4) An voting and geometric verification algorithm is used to identify final results. This framework thus is essentially simple by skipping many steps, such as normalization, binarization and OCR, which are required in previous methods. Moreover, this framework is robust as only SIFT feature is used. We evaluated our method using 200,000+ images in 3 scripts (Chinese, Japanese and Korean). We obtained average single-character success rate of 77.3% (highest 94.1%), average multiple-character success rate of 63.9% (highest 89.6%).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Lowe, D.: Distinctive image features from scale-invariant keypoints. In: ICCV, vol. 2, pp. 91–110 (2004)
Google Scholar
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: Speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)
Chapter Google Scholar
Ke, Y., Sukthankar, R.: PCA-SIFT: A more distinctive representation for local image descriptors. In: CVPR, vol. 2, pp. 506–513 (2004)
Google Scholar
Mikolajczyk, K., Schmid, C.: A Performance Evaluation of Local Descriptors. In: CVPR, vol. 2, pp. 257–263 (2003)
Google Scholar
Tuytelaars, T., Mikolajczyk, K.: A Survey on Local Invariant Features. Foundations and Trends in Computer Graphics and Vision (2008)
Google Scholar
Chen, X., Yuille, A.: Detecting and Reading Text in Natural Scenes. In: CVPR, vol. 2, pp. 366–373 (2004)
Google Scholar
Chen, X., Yang, J., Zhang, J., Waibel, A.: Automatic detection and recognition of signs from natural scenes. IEEE Transactions on Image Processing 13, 87–99 (2004)
Article Google Scholar
Chang, S.L., Chen, L.S., Chung, Y.C., Chen, S.W.: Automatic License Plate Recognition. IEEE Transactions on Intelligent Transportation Systems 5, 42–53 (2004)
Article Google Scholar
Koga, M., Mine, R., Kameyama, T., Takahashi, T., Yamazaki, M., Yamaguchi, T.: Camera-based Kanji OCR for mobile-phones: practical issues. In: ICDAR (2005)
Google Scholar
Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: A survey. IJDAR 7, 84–104 (2005)
Article Google Scholar
Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recognition 37, 977–997 (2004)
Article Google Scholar
Fujisawa, H.: Forty years of research in character and document recognition - an industrial perspective. Pattern Recognition 41, 2435–2446 (2008)
Article Google Scholar
de Campos, T.E., Babu, B.R., Varma, M.: Character recognition in natural images. In: VISAPP (2009)
Google Scholar
Lv, Q., Josephson, W., Wang, Z., Charikar, M., Li, K.: Multi-probe LSH: Efficient indexing for high-dimensional similarity search. In: VLDB, pp. 950–961 (2007)
Google Scholar
Johnson, D.S.: Approximation algorithms for combinational problems. JCSS 9, 256–278 (1974)
MATH Google Scholar
Leordeanu, M., Hebert, M.: A Spectral Technique for Correspondence Problems Using Pairwise Constraints. In: ICCV, vol. 2, pp. 1482–1489 (2005)
Google Scholar
Viola, P., Jones, M.: Rapid Object Detection using a Boosted Cascade of Simple Features. In: CVPR, pp. 511–218 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Security Engineering, Shanghai Jiao Tong University, China
Qi Zheng, Kai Chen, Yi Zhou & Congcong Gu
Department of Computer Science and Engineering, Shanghai Jiao Tong University, China
Haibing Guan

Authors

Qi Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Kai Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yi Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Congcong Gu
View author publications
You can also search for this author in PubMed Google Scholar
Haibing Guan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Technion – Israel Institute of Technology, Department of Computer Science, 32000, Haifa, Israel
Ron Kimmel
The University of Auckland, 37 Kohimarama Road , Mission Bay, 1071, Auckland, New Zealand
Reinhard Klette
National Institute of Informatics, Chiyoda, 1018430, Tokyo, Japan
Akihiro Sugimoto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zheng, Q., Chen, K., Zhou, Y., Gu, C., Guan, H. (2011). Text Localization and Recognition in Complex Scenes Using Local Features. In: Kimmel, R., Klette, R., Sugimoto, A. (eds) Computer Vision – ACCV 2010. ACCV 2010. Lecture Notes in Computer Science, vol 6494. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19318-7_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-19318-7_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19317-0
Online ISBN: 978-3-642-19318-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics