Tough Tables: Carefully Benchmarking Semantic Table Annotators
- 1. University of Milano - Bicocca
- 2. Bocconi University
- 3. City, University of London
Description
Tough Tables (2T) is a dataset designed to evaluate table annotation approaches on the CEA task.
The dataset is compliant with the data format used in SemTab2019, and it can be used as an additional dataset without any modification. Annotations are based on DBpedia 2016-10.
Note on License: This dataset includes data from the following sources. Refer to each source for license details:
- Wikipedia https://www.wikipedia.org/
- DBpedia http://dbpedia.org/
- SemTab2019 https://doi.org/10.5281/zenodo.3518539
- GeoDatos https://www.geodatos.net
- The Pudding https://pudding.cool/
- Offices.net https://offices.net
- DATA.GOV https://www.data.gov/
THIS DATA IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
Notes
Files
2T.zip
Files
(1.8 MB)
Name | Size | Download all |
---|---|---|
md5:c6246b90379963df7b40f4e509ded921
|
1.8 MB | Preview Download |