Dataset Open Access

MetaLink - Closure and Error Degree of 556M owl:sameAs statements

Beek, Wouter; Raad, Joe; Acar, Erman; Van Harmelen, Frank


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Linked Open Data</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Identity</subfield>
  </datafield>
  <controlfield tag="005">20200124192304.0</controlfield>
  <controlfield tag="001">3227976</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Vrije Universiteit Amsterdam</subfield>
    <subfield code="0">(orcid)0000-0002-7891-7738</subfield>
    <subfield code="a">Raad, Joe</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Vrije Universiteit Amsterdam</subfield>
    <subfield code="a">Acar, Erman</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Vrije Universiteit Amsterdam</subfield>
    <subfield code="0">(orcid)0000-0002-7913-0048</subfield>
    <subfield code="a">Van Harmelen, Frank</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">35980254447</subfield>
    <subfield code="z">md5:1546c60c84d57abd448478c0cc89bb86</subfield>
    <subfield code="u">https://zenodo.org/record/3227976/files/data.hdt</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">36997120717</subfield>
    <subfield code="z">md5:39121f59c72b4f3ffcc63649bc5eb3fc</subfield>
    <subfield code="u">https://zenodo.org/record/3227976/files/data.hdt.index.v1-1</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2019-04-10</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">user-linkeddata</subfield>
    <subfield code="p">user-semantic-web</subfield>
    <subfield code="o">oai:zenodo.org:3227976</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Vrije Universiteit Amsterdam</subfield>
    <subfield code="0">(orcid)0000-0003-0250-9655</subfield>
    <subfield code="a">Beek, Wouter</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">MetaLink - Closure and Error Degree of 556M owl:sameAs statements</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-linkeddata</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-semantic-web</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by-sa/3.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution Share Alike 3.0 Unported</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;MetaLink is a dataset that contains metadata for a very large set of owl:sameAs links that are crawled from the LOD Cloud.&amp;nbsp;MetaLink encodes a previously published error metric for each of these links &lt;a href="https://www.cs.vu.nl/~frankh/postscript/ISWC2018.pdf"&gt;[Raad et al., 2018]&lt;/a&gt;. This error degree ranges&amp;nbsp;from 0.0 (most likely correct) till 1.0 (most likely incorrect). The idea is that the more an owl:sameAs link is isolated in the network (of all owl:sameAs links), the higher&amp;nbsp;error degree this link&amp;nbsp;will have.&amp;nbsp;Experiments shows that discarding the 1M owl:sameAs links with an error degree &amp;gt;0.99 can significantly increase the quality of the transitive closure. Also by keeping only the 400M owl:sameAs links with error degree &amp;lt;= 0.4, the resulting closure is 100% precise in several manually evaluated cases. The resulted equivalence classes from these different closures are &lt;a href="https://zenodo.org/record/3345674"&gt;publicly available online.&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;MetaLink is published in combination with LOD-a-lot, a dataset that is based on a very large crawl of a subset of the LOD Cloud. By combining MetaLink and LOD-a-lot, applications are able to make informed decisions about whether or not to follow specific links on the LOD Cloud. This dataset contains 4,352,602,452 unique triples, and is available in HDT (Header Dictionary Triples) format. It can be navigated online using the TriplyDB Linked Data hosting platform:&amp;nbsp;&lt;a href="https://krr.triply.cc/krr/metalink"&gt;https://krr.triply.cc/krr/metalink&lt;/a&gt;.&amp;nbsp;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;A figure describing&amp;nbsp;the vocabulary of the MetaLink&amp;nbsp;dataset can be found &lt;a href="https://krr.triply.cc/krr/metalink/assets/5ced8dbfaea75b02bc98784f"&gt;here&lt;/a&gt;.&amp;nbsp;Classes are displayed by circles and properties are displayed by arcs. The MetaLink-specific classes and properties are displayed in red, the blue classes and properties are reused from existing vocabularies.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.3227975</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.3227976</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
192
42
views
downloads
All versions This version
Views 192193
Downloads 4242
Data volume 1.5 TB1.5 TB
Unique views 158159
Unique downloads 1515

Share

Cite as