Engineering Drawing Text Detection via Better Feature Fusion

Wang, Hainan; Shan, Hua; Song, Yu; Meng, Yue; Wu, Mei

doi:10.1007/978-3-031-36819-6_23

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13925))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

325 Accesses
1 Citations

Abstract

In recent years, text detection technology has advanced significantly. However, research on text detection of engineering drawings is lacking. The challenges faced by engineering drawing text detection are the degradation of partial occlusion and adhesion within texts, as well as the complex background noise. To address this problem, we propose an end-to-end text detection framework for degraded drawings based on multiscale feature fusion and instance segmentation, which adopts pluggable and stackable multiscale feature fusion modules to enhance the accuracy of the degraded text. We conduct experiments on several benchmarks to demonstrate the effectiveness of the proposed method on degraded drawing text and natural scene text.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Deng, D., Liu, H., Li, X., Cai, D.: Pixellink: detecting scene text via instance segmentation. In: AAAI Conference on Artificial Intelligence (2018). https://doi.org/10.1609/aaai.v32i1.12269
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Long, S., Ruan, J., Zhang, W., He, X., Wu, W., Yao, C.: Textsnake: a flexible representation for detecting text of arbitrary shapes. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 20–36 (2018). https://doi.org/10.1007/978-3-030-01216-8_2
Lyu, P., Yao, C., Wu, W., Yan, S., Bai, X.: Multi-oriented scene text detection via corner localization and region segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7553–7563 (2018). https://doi.org/10.1109/CVPR.2018.00788
Nayef, N., et al.: ICDAR 2017 robust reading challenge on multi-lingual scene text detection and script identification-RRC-MLT. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1, pp. 1454–1459. IEEE (2017). https://doi.org/10.1109/ICDAR.2017.237
Shi, B., Bai, X., Belongie, S.: Detecting oriented text in natural images by linking segments. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2550–2558 (2017). https://doi.org/10.1109/CVPR.2017.371
Tian, Z., Huang, W., He, T., He, P., Qiao, Yu.: Detecting text in natural image with connectionist text proposal network. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9912, pp. 56–72. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46484-8_4
Chapter Google Scholar
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1
Chapter Google Scholar
Yao, C., Bai, X., Sang, N., Zhou, X., Zhou, S., Cao, Z.: Scene text detection via holistic, multi-channel prediction. arXiv preprint arXiv:1606.09002 (2016)
Yuliang, L., Lianwen, J., Shuaitao, Z., Sheng, Z.: Detecting curve text in the wild: new dataset and new solution. arXiv preprint arXiv:1712.02170 (2017)
Zhu, Y., Du, J.: Sliding line point regression for shape robust scene text detection. In: 2018 24th International Conference on Pattern Recognition (ICPR), pp. 3735–3740. IEEE (2018). https://doi.org/10.1109/ICPR.2018.8545067

Download references

Author information

Authors and Affiliations

Jiangsu Frontier Electric Technology Co., Ltd., Nanjing, Jiangsu, China
Hainan Wang, Hua Shan, Yu Song, Yue Meng & Mei Wu

Authors

Hainan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hua Shan
View author publications
You can also search for this author in PubMed Google Scholar
Yu Song
View author publications
You can also search for this author in PubMed Google Scholar
Yue Meng
View author publications
You can also search for this author in PubMed Google Scholar
Mei Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hainan Wang .

Editor information

Editors and Affiliations

Universiti Teknologi Malaysia, Kuala Lumpur, Malaysia
Hamido Fujita
Shanghai University of Finance and Economics, Shanghai, China
Yinglin Wang
Fudan University, Shanghai, China
Yanghua Xiao
Texas State University, San Marcos, TX, USA
Ali Moonis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, H., Shan, H., Song, Y., Meng, Y., Wu, M. (2023). Engineering Drawing Text Detection via Better Feature Fusion. In: Fujita, H., Wang, Y., Xiao, Y., Moonis, A. (eds) Advances and Trends in Artificial Intelligence. Theory and Applications. IEA/AIE 2023. Lecture Notes in Computer Science(), vol 13925. Springer, Cham. https://doi.org/10.1007/978-3-031-36819-6_23

Download citation

DOI: https://doi.org/10.1007/978-3-031-36819-6_23
Published: 19 July 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36818-9
Online ISBN: 978-3-031-36819-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Engineering Drawing Text Detection via Better Feature Fusion