Abstract
With the invention of deep learning, optical CR (OCR) has achieved state-of-the-art performance for Latin, Arabic and Chinese scripts. However for regional languages, lack of large and high-quality training data needed to train the deep neural networks is a major barrier for the usage of deep learning. To produce a diverse synthetic data similar to realistic data, a class of deep networks called Generative Adversarial Networks (GANs) were proposed. GANs have performed efficiently for non Indic scripts and in various other fields like object detection, image translation, etc. Gurumukhi script is one of the regional Indic scripts widely used in North India and some other parts of the world like Canada, USA, etc. In this work, we have evaluated the power of GANs for generating the character dataset of regional language of India. Experimental work has been done using variants of GANs to generate the synthetic data for Gurumukhi script. Deep convolution GAN (DCGAN), Wasserstein GAN (WGAN), Least square GAN (LSGAN) and Conditional GAN (CGAN) have been evaluated for the Gurumukhi handwritten characters dataset. GANs with different network structures, training methods, loss functions and optimizers are used and compared for Gurumukhi handwritten character dataset to find the optimal parameters and results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kong H, Tang D, Meng X, Lu T (2019) GARN: a novel generative adversarial recognition network for end-to-end scene character recognition. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE, pp 689–694
Kaur S, Bawa S, Kumar R (2019) A survey of mono-and multi-lingual character recognition using deep and shallow architectures: indic and non-indic scripts. Artif Intell Rev 1–60
Cai J, Peng L, Tang Y, Liu C, Li P (2019) TH-GAN: generative adversarial network based transfer learning for historical Chinese character recognition. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE, pp 178–183
Sinha A, Jenckel M, Bukhari SS, Dengel A (2019) Unsupervised OCR model evaluation using GAN. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE, pp 1256–1261
Qian Z, Huang K, Wang Q-F, Xiao J, Zhang R (2020) Generative adversarial classifier for handwriting characters super-resolution. Pattern Recogn 107:107453
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. In: Advances in neural information processing systems, vol 27
Cheng K, Tahir R, Eric LK, Li M (2020) An analysis of generative adversarial networks and variants for image synthesis on MNIST dataset. Multimedia Tools Appl 79(19):13725–13752
Kumar M, Sharma RK, Jindal MK, Jindal SR, Singh H (2018) Benchmark datasets for offline handwritten Gurmukhi script recognition. In: Workshop on document analysis and recognition. Springer, Singapore, pp 143–151
Kumar M, Jindal MK, Sharma RK, Jindal SR (2020) Performance evaluation of classifiers for the recognition of offline handwritten Gurmukhi characters and numerals: a study. Artif Intell Rev 53(3):2075–2097
Kaur S, Verma K (2020) Handwritten Devanagari character generation using deep convolutional generative adversarial network. In: Soft computing: theories and applications. Springer, Singapore, pp 1243–1253
Rusticus D, Goldmann L, Reisser M, Villegas M (2019) Document domain adaptation with generative adversarial networks. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE, pp 1432–1437
Wu S, Zhai W, Cao Y (2019) PixTextGAN: structure aware text image synthesis for license plate recognition. IET Image Process 13(14):2744–2752
Hu T, Long C, Xiao C (2021) A novel visual representation on text using diverse conditional GAN for visual recognition. IEEE Trans Image Process 30:3499–3512. https://doi.org/10.1109/TIP.2021.3061927
Dong Y, Zhang Y, Ma L, Wang Z, Luo J (2021) Unsupervised text-to-image synthesis. Pattern Recogn 110:107573
Zhao J, Wang Y, Xiao B, Shi C, Jia F, Wang C (2020) DetectGAN: GAN-based text detector for camera-captured document images. Int J Doc Anal Recogn (IJDAR) 23(4):267–277
Shao L, Liang C, Wang K, Cao W, Zhang W, Gui G, Sari H (2019) Attention GAN-based method for designing intelligent making system. IEEE Access 7:163097–163104
Alonso E, Moysset B, Messina R (2019) Adversarial generation of handwritten text images conditioned on sequences. In: 2019 International conference on document analysis and recognition (ICDAR). IEEE
Radford A, Metz L, Chintala S (2015) Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434
Hussain S, Anees A, Das A, Nguyen BP, Marzuki M, Lin S, Wright G, Singhal A (2020) High-content image generation for drug discovery using generative adversarial networks. Neural Networks 132:353–363
Wu Q, Chen Y, Meng J (2020) DCGAN-based data augmentation for tomato leaf disease identification. IEEE Access 8:98716–98728
Arjovsky M, Chintala S, Bottou L (2017) Wasserstein GAN. arXiv:1701.07875
Mao X, Li Q, Xie H, Lau RYK, Wang Z, Smolley SP (2017) Least squares generative adversarial networks. In: Proceedings of the IEEE international conference on computer vision, pp 2794–2802
Mirza M, Osindero S (2014) Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784
Acknowledgements
This research was supported by Council of Scientific and Industrial Research (CSIR) funded by the Ministry of Science and Technology (09/677(0031)/2018/EMR-I) as well as the Government of India.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kaur, S., Bawa, S., Kumar, R. (2023). Evaluating Generative Adversarial Networks for Gurumukhi Handwritten Character Recognition (CR). In: Sisodia, D.S., Garg, L., Pachori, R.B., Tanveer, M. (eds) Machine Intelligence Techniques for Data Analysis and Signal Processing. Lecture Notes in Electrical Engineering, vol 997. Springer, Singapore. https://doi.org/10.1007/978-981-99-0085-5_41
Download citation
DOI: https://doi.org/10.1007/978-981-99-0085-5_41
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-0084-8
Online ISBN: 978-981-99-0085-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)