Dataset Open Access

DBNL OCR Data set

DBNL

A set of 220 books digitised by the Dutch DBNL (https://dbnl.org/). The set contains the original OCR output in .txt and the corrected version in TEI.

Files (298.4 MB)
Name Size
Metadata_DBNL_OCR_v1.xlsx
md5:838c6ff1153fca86074062f7238b4048
19.8 kB Download
TEI.zip
md5:cfa4f356cfa424af8e2704c0356ed82b
146.0 MB Download
TXT.zip
md5:5f2c44cdb00e0a06ca10e00441a2dcb3
152.3 MB Download
118
59
views
downloads
All versions This version
Views 118118
Downloads 5959
Data volume 6.7 GB6.7 GB
Unique views 109109
Unique downloads 3737

Share

Cite as