Dataset Open Access

Annotated (Pseudo-)Transitive Relations of the LOD Cloud

Wang, Shuai; Raad, Joe; Bloem, Peter; van Harmelen, Frank

This deposit consists of the manually annotated gold standard, graph files, and evaluation results for our study of (pseudo-)transitive relations.

The gold-standard.zip file provides the files consisting of manually evaluated triples. 
The files were exported from ANNit with the column:
- LEFT* for the source URI of the triple.
- RIGHT* for the target URI of the triple.
- UserChioce for the choice of user when manually evaluated
- Decision* for the actual decision made by rater/annotator. It can only be unknown, remove, remain.
- Comment, if any.

The corresponding gold-standard.pdf file provides all the details about the creation of the gold standard. 

The folder graph_file includes the unweighted graphs, as well as the two sets of weighted graphs: the graphs with counted weights and the graphs with inferred weights (in the subdirectory of counted_weights and inferred_weights subdirectory respectively). 
The files are compressed in the format of *.gz. Each file consists of two columns of integers as the source and the target. The integers correspond to the URIs. The corresponding mapping files are in the directory mapping.

The corresponding files (of unweighted graphs) in WebGraph format are provided. These files were used when evaluating our algorithm against the exiting web-scale feedback-arc-set algorithm.

 

The source code for refinement is at https://github.com/shuaiwangvu/Refining-Transitive-Relations

Some raw data for table 2 and table 3 in our corresponding paper and their analysis are also provided.  There were two static settings of the parameter for Table 2 and we chose the first setting for the final presentation in the paper.

  • XL (static): b1 = 15,000 and b2 = 3,000
  • L (static): b1 = 1,000 and b2 = 200

Should there be any problem with these datasets, please feel free to report to us at the following email address: shuai.wang@vu.nl.

A link to the full paper will be published when the paper is accepted. 

Files (265.6 MB)
Name Size
Gold standard description.pdf
md5:9be4f011a06c499ab4372c64f8902ceb
107.8 kB Download
gold-standard.zip
md5:40058a655c95748f5541fac88eb7e630
50.2 kB Download
graph_files.zip
md5:eec21528db5dabb580d1836cf82d8dc4
206.9 MB Download
other-files-for-gold-standard.zip
md5:efdad97b05c4279157c5aa507b6cebfc
8.7 kB Download
raw-evaluation-results.pdf
md5:ef241a66d4e972a6ae832c3d0b4897f0
38.7 kB Download
raw_data_removed_edges.zip
md5:91023bedd1717bc7d27e14057e0fd8b6
20.9 MB Download
webGraphformat.zip
md5:7f4bffd89ba5f303a2f255e1cb6c19b4
37.6 MB Download
222
93
views
downloads
All versions This version
Views 222222
Downloads 9393
Data volume 1.7 GB1.7 GB
Unique views 204204
Unique downloads 8282

Share

Cite as