Dataset Open Access

DBNL OCR Data set

DBNL

A set of 220 books digitised by the Dutch DBNL (https://dbnl.org/). The set contains the original OCR output in .txt and the corrected version in TEI.

Files (298.4 MB)
Name Size
Metadata_DBNL_OCR_v1.xlsx
md5:838c6ff1153fca86074062f7238b4048
19.8 kB Download
TEI.zip
md5:cfa4f356cfa424af8e2704c0356ed82b
146.0 MB Download
TXT.zip
md5:5f2c44cdb00e0a06ca10e00441a2dcb3
152.3 MB Download
362
195
views
downloads
All versions This version
Views 362362
Downloads 195195
Data volume 22.1 GB22.1 GB
Unique views 327327
Unique downloads 119119

Share

Cite as