Abstract
Document analysis is an active field of research which can attain a complete understanding of semantics of a given document. Further, digitalization of physical forms of various data has become vital in today’s data extraction. Comic digitalization is becoming widespread as it is one of the easily understandable graphic content and is attention seeking from early education readers to middle aged groups. Extracting the frames, panels, and speech balloons from digital comic is crucial for techniques that felicitate comic reading. However, Automatic Panel Extraction for Digital Comic is challenging, largely because of its layout design, visual symbols, speech balloons attached to almost all the panels, throughout the page. In this proposed work, it is proposed to automatically extract panels from digital pages using contour analysis and watershed canny operator. The first method identifies the difference between frames associated with the panels, whereas the second method identifies the difference in the color between the panel and the gutters. Speech balloons are segmented by methods of K-means clustering and contour analysis. K-means clustering is used to identify the closest related components, and contour analysis is used to differentiate the speech balloons from the other components in the comic panel. Text Area Recognition is a subtle approach that is implemented by Optical Character Recognition (OCR). And finally, comic character extraction has three steps of annotating the object, training the model, and detection of the comic character. The annotation is performed by bounding box algorithm, followed by training of the custom model using a pre-trained YOLOV3 algorithm. Once the custom model is trained, it is provided with comic strips as input to detect the dominant characters with their probability of confidence, which is the correctness of comic character. The implementation considerably serves better in performance when compared to the previous traditional methods of comic component analysis and extraction. Evaluation on the same has also been performed for the trained as well as input dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Rigaud C, Tsopze N, Burie JC, Ogier JC (2011) Robust frame and text extraction from comic books. In: Proceedings of international workshop on graphics recognition, pp 129–138
Li L, Wang Y, Tang Z, Lu X, Gao L (2013) Unsupervised speech text localization in comic images. In: Proceedings of the IEEE international conference on document analysis and recognition, pp 1190–1194
Devi MK, Fathima S, Baskaran R (2020) CBCS-comic book cover synopsis: generating synopsis of a comic book with unsupervised abstractive dialogue. Procedia Comput Sci 172:701–708
Takahashi A, Oie T, Hirano K, Higuchi M, Kawasaki S, Koike A, Murukami H (2011) Fast frame decomposition and sorting by contour tracing for mobile phone comic images. Int J Syst Appl Eng Develop 5:216–223
Li L, Wang L, Tang Z, Gao L (2014) Automatic comic page segmentation based on polygon detection. Multimedia Tools Appl 69(1):171–197
Han E, Kim K, Yang H, Jung K (2007) Frame segmentation used MLP based X-Y recursive for mobile cartoon content. In: Proceedings of the international conference on human-computer interaction, pp 872–881
Arai K, Tolle H (2011) Method for real time text extraction of digital manga comic. Int J Image Process, pp 669–676
Augereau O, Iwata M, Kise K (2018) A survey of comics research in computer science. arXiv preprint arXiv: 1804.05490
Ponsard C, Ramdoyal R, Dziamski D (2012) An OCR-enabled digital comic books viewer. In: proceedings of international conference on computers for handicapped persons, pp 471–478
Pang X, Cao Y, Lau WR, Antoni B (2014) A robust panel extraction method for manga. In: Proceedings of the ACM international conference on multimedia, pp 1125–1128
Wang Y, Zhou Y, Tang Z (2015) Comic frame extraction via line segments combination. In: Proceedings of IEEE international conference on document analysis and recognition, pp 856–860
Smith R (2007) An Overview of the tesseract OCR engine. In: Proceedings of IEEE international conference on document analysis and recognition, vol 2, pp 629–633
Karthika Devi MS, Umaa Mahesswari G, Ramachandran B (2022) Dialogue extraction and translation from stories on thirukural using verb cue quote content source identifiers. In: Senjyu T, Mahalle PN, Perumal T, Joshi A (eds) ICT with intelligent applications. Smart innovation, systems and technologies, vol 248
Rigaud C, Guerin C, Karatzas D, Burie JC, Ogier JM (2015) Knowledge-driven understanding of images in comic books. Int J Doc Anal Recogn (3):199–221
Nguyen NV, Rigaud C, Burie JC (2018) Digital comics image indexing based on deep learning. Proc J Imaging (7)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Baskaran, R., Karthika Devi, M.S. (2023). Comic Character Recognition (CCR): Extraction of Speech Balloon Context and Character of Interest in Comics. In: Seetha, M., Peddoju, S.K., Pendyala, V., Chakravarthy, V.V.S.S.S. (eds) Intelligent Computing and Communication. ICICC 2022. Advances in Intelligent Systems and Computing, vol 1447. Springer, Singapore. https://doi.org/10.1007/978-981-99-1588-0_34
Download citation
DOI: https://doi.org/10.1007/978-981-99-1588-0_34
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-1587-3
Online ISBN: 978-981-99-1588-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)