There is a newer version of the record available.

Published March 7, 2024 | Version v1
Dataset Open

Wiki-TabNER dataset

Contributors

Contact person:

Data curator:

Description

This is the dataset described in the paper Wiki-TabNER:Advancing Table Interpretation Through Named Entity Recognition

It Is a dataset containing tables extracted from Wikipedia pages and annotated with Dbpedia entity types. It can be used for solving NER within tables and for the entity linking task. The file dataset_entities_labeled.csv contains all the linked entities that are mentioned in the tables and their corresponding Wikipedia IDs. This file is for the evaluation of the entity linking task.

Files

dataset_entities_labeled.csv

Files (2.5 GB)

Name Size Download all
md5:d5a0ac2319dcc222f798b0f2a36b6112
20.2 MB Preview Download
md5:41ee99347707053b9c93ede00d7da75d
278.2 MB Preview Download
md5:868df206c1d667e6a3c630f2535134e3
13.3 MB Preview Download
md5:22d47d1d805657e3df9b234085f45245
2.2 GB Preview Download

Additional details

Related works

Is published in
Publication: arXiv:2403.04577 (arXiv)