Combining OCR Models for Reading Early Modern Printed Books

Seuret, Mathias; van der Loop, Janne; Weichselbaumer, Nikolaus; Mayr, Martin; Molnar, Janina; Hass, Tatjana; Kordon, Florian; Nicolau, Anguelos; Christlein, Vincent

doi:10.1007/978-3-031-41734-4_21

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.07131 (cs)

[Submitted on 11 May 2023]

Title:Combining OCR Models for Reading Early Modern Printed Books

Authors:Mathias Seuret, Janne van der Loop, Nikolaus Weichselbaumer, Martin Mayr, Janina Molnar, Tatjana Hass, Florian Kordon, Anguelos Nicolau, Vincent Christlein

View PDF

Abstract:In this paper, we investigate the usage of fine-grained font recognition on OCR for books printed from the 15th to the 18th century. We used a newly created dataset for OCR of early printed books for which fonts are labeled with bounding boxes. We know not only the font group used for each character, but the locations of font changes as well. In books of this period, we frequently find font group changes mid-line or even mid-word that indicate changes in language. We consider 8 different font groups present in our corpus and investigate 13 different subsets: the whole dataset and text lines with a single font, multiple fonts, Roman fonts, Gothic fonts, and each of the considered fonts, respectively. We show that OCR performance is strongly impacted by font style and that selecting fine-tuned models with font group recognition has a very positive impact on the results. Moreover, we developed a system using local font group recognition in order to combine the output of multiple font recognition models, and show that while slower, this approach performs better not only on text lines composed of multiple fonts but on the ones containing a single font only as well.

Comments:	Accepted to ICDAR23
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2305.07131 [cs.CV]
	(or arXiv:2305.07131v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.07131
Journal reference:	Document Analysis and Recognition - ICDAR 2023. ICDAR 2023. Lecture Notes in Computer Science, vol 14191. Springer, Cham
Related DOI:	https://doi.org/10.1007/978-3-031-41734-4_21

Submission history

From: Vincent Christlein [view email]
[v1] Thu, 11 May 2023 20:43:50 UTC (867 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Combining OCR Models for Reading Early Modern Printed Books

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Combining OCR Models for Reading Early Modern Printed Books

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators