Dataset Open Access

dhlab-epfl/LinkedBooksReferenceParsing: 1.1

Colavizza, Giovanni; Romanello, Matteo

Data curator(s)
Babetto, Martina; Ferronato, Silvia

A dataset of annotated references (in both reference lists and footnotes) from journal issues and monographs on the history of Venice, created in the context of the LinkedBooks project (http://dhlab.epfl.ch/page-127959-en.html). The dataset contains annotated reference lists of monographs and annotated references from the footnotes of journal issues from the following journals (mostly, but not exclusively in Italian): Ateneo Veneto, Archvio Veneto, Studi Veneziani. This dataset was digitized, OCRed (using ABBYY FineReader) and annotated (using Brat ADD) from 2014 to 2016. Along the dataset of annotations, a framework to train your own parsers is provided, based on Conditional Random Fields. Feel free to use it to build your own parser, and if you improve on our results, please let us know!

Files (18.9 MB)
Name Size
dhlab-epfl/LinkedBooksReferenceParsing-1.1.zip
md5:169911236679f38441c6c96371ebae70
18.9 MB Download
13
1
views
downloads
All versions This version
Views 1312
Downloads 11
Data volume 18.9 MB18.9 MB
Unique views 1312
Unique downloads 11

Share

Cite as