Published December 4, 2023 | Version 2.0
Dataset Open

PARESv2 : PArish REgistry Survey − Historical Census Table Dataset (19th, 20th centuries) − France

  • 1. ROR icon University of La Rochelle

Contributors

Data collector:

  • 1. TEKLIA
  • 2. INED

Description

PARES Dataset v2

PARES (PArish REcord Survey) contains 535 images of handwritten census tables for years ranging from around 1650 A.D. until 1850 A.D..They come from two French cities, Vic-sur-Seille (French department of Moselle) and Echevronne (French department of Côte d'Or). While they mention very ancient times, the documents are handwritten transcriptions of even older documents and are quite recent, copied from original documents during the 1950's and 1960's for demographic studies led by the INED in France (Institut National des études démographiques − National Institute for Demographic Studies). These copies were made by only a few different writers.

The documents are damaged and exhibit different types of degradations. We identified seven different document categories we call C1 to C7. C1 and C3 are generally high-quality documents, without serious damage, consisting of about 90% of the dataset. Other categories include highly damaged documents or documents with specificities.

A notable aspect of this dataset is that the records are written using only two different physical paper templates. Categories n°1, 2, 3, 6 and 7 have 25 recordings while the categories 4 and 5 are higher and can record up to 35 recordings. C4 and C5 are the larger templates and differ from the rest of the documents.

We published a paper, Text Line Detection in Historical Index Tables: Evaluations on a New French PArish REcord Survey Dataset (PARES), in which we better describe the dataset and the tasks it's possible to run on it.

Files

Files (10.1 GB)

Name Size Download all
md5:8a09246fd01e41693ed4388401baea9d
10.0 GB Download
md5:104cfc4722caf0c011083ddd2aef18a3
44.8 MB Download
md5:3303dac94ed898f7b0695b55e838d522
4.9 MB Download
md5:4a27f2f6437c2c3261ba938d56d2e558
1.9 MB Download

Additional details

Related works

Cites
Dataset: 10.5281/zenodo.8089229 (DOI)
Is cited by
Conference paper: https://hal.science/hal-04207205 (URL)
Is supplement to
Dataset: 10.5281/zenodo.8337504 (DOI)

Dates

Submitted
2023-10-26