Mask Dataset for: Unconstrained Text Detection in Manga: a New Dataset and Baseline

doi:10.5281/zenodo.4511796

Published February 6, 2021 | Version 1.0

Dataset Open

Mask Dataset for: Unconstrained Text Detection in Manga: a New Dataset and Baseline

1. Universidad de Buenos Aires
2. Universidad Nacional de Luján

This is the dataset used in out paper "Unconstrained Text Detection in Manga: a New Dataset and Baseline". It contains 450 images with the text segmentation of images from Manga109 dataset (need to request access to this dataset in order to view original manga image).

Pre-processed version of the images is how they were saved straight out of GIMP. These were later processed before using for training.

Post-processed version of the images is after automatically removing small connected components and filling small holes. They are also slightly bigger in width/height in order to be multiples of 8.

The text is split in 2 colors: black and pink. Text in black represents text we consider easy to recognize, which is mostly when inside a speech bubble. Text in pink represents text we consider harder to detect, such as text in covers, sound effects or text outside speech bubbles.

Further details can be found in our paper.

Files

post-processed.zip

Files (19.2 MB)

Name	Size	Download all
post-processed.zip md5:6555af6b2aa6d80d87fe7a4ce0b886a7	11.3 MB	Preview Download
pre-processed.zip md5:8d878735631bfd5fa98398ae5203a53d	7.9 MB	Preview Download

Additional details

Is referenced by: Conference paper: 10.1007/978-3-030-67070-2_38 (DOI)

Views

358

Downloads

Show more details

	All versions	This version
Views	1,410	1,404
Downloads	358	356
Data volume	5.1 GB	5.1 GB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

Published in

ECCV 2020: Computer Vision – ECCV 2020 Workshops. Lecture Notes in Computer Science. Springer., 12537, 629-646, 2021.

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: February 6, 2021
Modified: February 7, 2021

Mask Dataset for: Unconstrained Text Detection in Manga: a New Dataset and Baseline

Creators

Description

Files

post-processed.zip

Files (19.2 MB)

Additional details

Related works