Published September 25, 2021 | Version v1
Dataset Open

Script Classification and Writer Identification: Two Tasks for a Common Understanding of Cultural Heritage - Dataset

  • 1. Institut de Recherche et d'Histoire des Textes (CNRS)
  • 2. Brigham Young University
  • 3. Friedrich-Alexander-Universität Erlangen-Nürnberg

Description

Writer identification and Script classification are usually considered as two separated and very different tasks, as well in palaeography as in computer science. Following the ICDAR competition on the CLAMM corpus about script classification and dating, this dataset provides the output created by running two infrastructures created for Script classification at a large scale (medieval scripts) on a more homogeneous dataset with a focus on Writer Identification.

If you use the present repository, its data and figures, please consult and cite:

Stutzmann, Dominique, Christopher Tensmeyer, and Vincent Christlein. « Writer Identification and Script Classification: Two Tasks for a Common Understanding of Cultural Heritage ». manuscript cultures, 15 (2020): 11-24. https://www.csmc.uni-hamburg.de/publications/mc/files/articles/mc15-02-stutzmann.pdf

Files

Script-Classification-Writer-Identification.zip

Files (67.0 MB)

Name Size Download all
md5:438bcae73037cfad22625bc065735edc
67.0 MB Preview Download

Additional details

Related works

Is cited by
Journal article: https://hal.archives-ouvertes.fr/hal-03320104 (URL)
References
Dataset: 10.5281/zenodo.5527690 (DOI)

References