Conference paper Open Access

Effect of heuristic post-processing on knowledge graph profile patterns: cross-domain study

Gollam Rabby; Farhana Keya; Vojtēc Svátek; Renzo Arturo Alva Principe


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Profiling · knowledge graph · ABSTAT · pattern · linguistics · COVID-19</subfield>
  </datafield>
  <controlfield tag="005">20220719152429.0</controlfield>
  <controlfield tag="001">6827777</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Prague University of Economics and Business, Prague, Czech Republic</subfield>
    <subfield code="a">Farhana Keya</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Prague University of Economics and Business, Prague, Czech Republic</subfield>
    <subfield code="a">Vojtēc Svátek</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Milano - Bicocca</subfield>
    <subfield code="a">Renzo Arturo Alva Principe</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">179762</subfield>
    <subfield code="z">md5:f8a8151d5dac52b697fa32f66edacaec</subfield>
    <subfield code="u">https://zenodo.org/record/6827777/files/Effect of heuristic post-processing on knowledge graph profile patterns cross-domain study.pdf</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2022-07-13</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire</subfield>
    <subfield code="p">user-nexuslinguarum</subfield>
    <subfield code="o">oai:zenodo.org:6827777</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Prague University of Economics and Business, Prague, Czech Republic</subfield>
    <subfield code="a">Gollam Rabby</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Effect of heuristic post-processing on knowledge graph profile patterns: cross-domain study</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-nexuslinguarum</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;&lt;em&gt;Sets of frequent schema-level patterns characterizing a given knowledge graph (KG) represent a central output of profiling tools such as ABSTAT, as they could provide a quick overview of the coverage of the KG and its adequacy for various tasks. However, the number of patterns may be huge, and the most frequent ones might not be the most useful ones for semantically characterizing the KG, since they might feature generic (OWL, SKOS, etc.) classes and even XML data types. We hypothesize that the pattern profile suitability for a &amp;lsquo;rapid skimming&amp;rsquo; scenario might be improved by applying a stop-list of namespaces or individual schema IRIs by which the original pattern set is pruned. We experimented with post-processing the patterns returned by ABSTAT with regard to reducing the quantity of patterns and re-ranking the patterns appearing in the first positions of the frequency-ordered results. We processed the sets of KGs from two different domains &amp;ndash; COVID-19 and linguistics/lexicography.&lt;/em&gt;&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.6827776</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.6827777</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">conferencepaper</subfield>
  </datafield>
</record>
38
29
views
downloads
All versions This version
Views 3838
Downloads 2929
Data volume 5.2 MB5.2 MB
Unique views 3030
Unique downloads 2626

Share

Cite as