There is a newer version of the record available.

Published May 22, 2020 | Version 0.1-pre
Dataset Open

Tough Tables: Carefully Benchmarking Semantic Table Annotators

  • 1. University of Milano - Bicocca
  • 2. Bocconi University
  • 3. City, University of London

Description

Tough Tables (2T) is a dataset designed to evaluate table annotation approaches on the CEA task.
The dataset is compliant with the data format used in SemTab2019, and it can be used as an additional dataset without any modification. Annotations are based on DBpedia 2016-10.

Note on License: This dataset includes data from the following sources. Refer to each source for license details:
- Wikipedia https://www.wikipedia.org/
- DBpedia http://dbpedia.org/
- SemTab2019 https://doi.org/10.5281/zenodo.3518539
- GeoDatos https://www.geodatos.net
- The Pudding https://pudding.cool/
- Offices.net https://offices.net
- DATA.GOV https://www.data.gov/

THIS DATA IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Notes

Draft submission. This version (v.0.1-pre) contains tables only, without the GT. A new version containing the GT will be published at the end of SemTab2020 (https://www.cs.ox.ac.uk/isg/challenges/sem-tab/2020/index.html).

Files

2T.zip

Files (1.8 MB)

Name Size Download all
md5:c6246b90379963df7b40f4e509ded921
1.8 MB Preview Download