Comic Character Recognition (CCR): Extraction of Speech Balloon Context and Character of Interest in Comics

Baskaran, R.; Karthika Devi, M. S.

doi:10.1007/978-981-99-1588-0_34

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1447))

Included in the following conference series:

International Conference on Intelligent Computing and Communication

139 Accesses

Abstract

Document analysis is an active field of research which can attain a complete understanding of semantics of a given document. Further, digitalization of physical forms of various data has become vital in today’s data extraction. Comic digitalization is becoming widespread as it is one of the easily understandable graphic content and is attention seeking from early education readers to middle aged groups. Extracting the frames, panels, and speech balloons from digital comic is crucial for techniques that felicitate comic reading. However, Automatic Panel Extraction for Digital Comic is challenging, largely because of its layout design, visual symbols, speech balloons attached to almost all the panels, throughout the page. In this proposed work, it is proposed to automatically extract panels from digital pages using contour analysis and watershed canny operator. The first method identifies the difference between frames associated with the panels, whereas the second method identifies the difference in the color between the panel and the gutters. Speech balloons are segmented by methods of K-means clustering and contour analysis. K-means clustering is used to identify the closest related components, and contour analysis is used to differentiate the speech balloons from the other components in the comic panel. Text Area Recognition is a subtle approach that is implemented by Optical Character Recognition (OCR). And finally, comic character extraction has three steps of annotating the object, training the model, and detection of the comic character. The annotation is performed by bounding box algorithm, followed by training of the custom model using a pre-trained YOLOV3 algorithm. Once the custom model is trained, it is provided with comic strips as input to detect the dominant characters with their probability of confidence, which is the correctness of comic character. The implementation considerably serves better in performance when compared to the previous traditional methods of comic component analysis and extraction. Evaluation on the same has also been performed for the trained as well as input dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Rigaud C, Tsopze N, Burie JC, Ogier JC (2011) Robust frame and text extraction from comic books. In: Proceedings of international workshop on graphics recognition, pp 129–138
Google Scholar
Li L, Wang Y, Tang Z, Lu X, Gao L (2013) Unsupervised speech text localization in comic images. In: Proceedings of the IEEE international conference on document analysis and recognition, pp 1190–1194
Google Scholar
Devi MK, Fathima S, Baskaran R (2020) CBCS-comic book cover synopsis: generating synopsis of a comic book with unsupervised abstractive dialogue. Procedia Comput Sci 172:701–708
Article Google Scholar
Takahashi A, Oie T, Hirano K, Higuchi M, Kawasaki S, Koike A, Murukami H (2011) Fast frame decomposition and sorting by contour tracing for mobile phone comic images. Int J Syst Appl Eng Develop 5:216–223
Google Scholar
Li L, Wang L, Tang Z, Gao L (2014) Automatic comic page segmentation based on polygon detection. Multimedia Tools Appl 69(1):171–197
Google Scholar
Han E, Kim K, Yang H, Jung K (2007) Frame segmentation used MLP based X-Y recursive for mobile cartoon content. In: Proceedings of the international conference on human-computer interaction, pp 872–881
Google Scholar
Arai K, Tolle H (2011) Method for real time text extraction of digital manga comic. Int J Image Process, pp 669–676
Google Scholar
Augereau O, Iwata M, Kise K (2018) A survey of comics research in computer science. arXiv preprint arXiv: 1804.05490
Google Scholar
Ponsard C, Ramdoyal R, Dziamski D (2012) An OCR-enabled digital comic books viewer. In: proceedings of international conference on computers for handicapped persons, pp 471–478
Google Scholar
Pang X, Cao Y, Lau WR, Antoni B (2014) A robust panel extraction method for manga. In: Proceedings of the ACM international conference on multimedia, pp 1125–1128
Google Scholar
Wang Y, Zhou Y, Tang Z (2015) Comic frame extraction via line segments combination. In: Proceedings of IEEE international conference on document analysis and recognition, pp 856–860
Google Scholar
Smith R (2007) An Overview of the tesseract OCR engine. In: Proceedings of IEEE international conference on document analysis and recognition, vol 2, pp 629–633
Google Scholar
Karthika Devi MS, Umaa Mahesswari G, Ramachandran B (2022) Dialogue extraction and translation from stories on thirukural using verb cue quote content source identifiers. In: Senjyu T, Mahalle PN, Perumal T, Joshi A (eds) ICT with intelligent applications. Smart innovation, systems and technologies, vol 248
Google Scholar
Rigaud C, Guerin C, Karatzas D, Burie JC, Ogier JM (2015) Knowledge-driven understanding of images in comic books. Int J Doc Anal Recogn (3):199–221
Google Scholar
Nguyen NV, Rigaud C, Burie JC (2018) Digital comics image indexing based on deep learning. Proc J Imaging (7)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, College of Engineering Guindy, Anna University, Chennai, Tamil Nadu, India
R. Baskaran & M. S. Karthika Devi

Authors

R. Baskaran
View author publications
You can also search for this author in PubMed Google Scholar
M. S. Karthika Devi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. S. Karthika Devi .

Editor information

Editors and Affiliations

Department of Computer Science, G. Narayanamma Institute of Technology and Science (Autonomous), Hyderabad, India
M. Seetha
Department of Computer Science, Indian Institute of Technology Roorkee, Roorkee, India
Sateesh K. Peddoju
Faculty of Data Science, San Francisco Bay Area, California, CA, USA
Vishnu Pendyala
Department of Electronics and Communication Engineering, Raghu Institute of Technology, Visakhapatnam, Andhra Pradesh, India
Vedula V. S. S. S. Chakravarthy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Baskaran, R., Karthika Devi, M.S. (2023). Comic Character Recognition (CCR): Extraction of Speech Balloon Context and Character of Interest in Comics. In: Seetha, M., Peddoju, S.K., Pendyala, V., Chakravarthy, V.V.S.S.S. (eds) Intelligent Computing and Communication. ICICC 2022. Advances in Intelligent Systems and Computing, vol 1447. Springer, Singapore. https://doi.org/10.1007/978-981-99-1588-0_34

Download citation

DOI: https://doi.org/10.1007/978-981-99-1588-0_34
Published: 20 September 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-1587-3
Online ISBN: 978-981-99-1588-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics