Published May 15, 2021 | Version 1.0.0
Dataset Open

Single-Writer Strikethrough Dataset

  • 1. Uppsala University

Description

This dataset contains registered pairs of clean and struck-through handwritten words which can be used for the task of (unpaired) strikethrough removal. The text, a passage from Bram Stoker's Dracula, was written by a single writer, using a blue ballpoint pen on regular white paper. The 756 word images have been systematically struck through, using one of the following stroke types: single horizontal line, double horizontal lines, diagonal, cross, wave, zigzag, scratch.

The dataset has been split into three subsets, train, validation and test, each balanced with regard to the number of samples per stroke type. Each split contains csv-files that indicate the stroke type that has been applied to a particular image.

Files

README.md

Files (19.8 MB)

Name Size Download all
md5:12cedd2198470f645d1a14462b8c6241
20.1 kB Download
md5:8e72051cb78d19153e2ad2091b137ca9
2.2 kB Preview Download
md5:bbd70d28eaeabe9d4c5f496c86eea613
52.7 kB Preview Download
md5:c2ec95cc1a35fe33042a5185b83ca6ff
10.8 MB Preview Download
md5:50591b8cadfe5f46d71f34bab3534b52
5.1 MB Preview Download
md5:00ca24d37122d80b24ff87a8865d646e
3.8 MB Preview Download