Published June 5, 2019 | Version v1
Dataset Open

DBNL OCR Data set

Creators

  • 1. KB National Library of the Netherlands

Description

A set of 220 books digitised by the Dutch DBNL (https://dbnl.org/). The set contains the original OCR output in .txt and the corrected version in TEI.

Files

TEI.zip

Files (298.4 MB)

Name Size Download all
md5:838c6ff1153fca86074062f7238b4048
19.8 kB Download
md5:cfa4f356cfa424af8e2704c0356ed82b
146.0 MB Preview Download
md5:5f2c44cdb00e0a06ca10e00441a2dcb3
152.3 MB Preview Download