UPDATE: Zenodo migration postponed to Oct 13 from 06:00-08:00 UTC. Read the announcement.

Dataset Open Access

Webis Known-Item Question Corpus 2013 (Webis-KIQC-13)

Hagen, Matthias; Stein, Benno

MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Matthias Hagen, Daniel Wägner, and Benno Stein. A Corpus of Realistic Known-Item Topics with Associated Web Pages in the ClueWeb09. In Advances in Information Retrieval. 37th European Conference on IR Research (ECIR 2015) volume 9022 of Lecture Notes in Computer Science, pages 741-754, Berlin Heidelberg New York, March 2015. Springer</subfield>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">questions</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">yahoo</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">known-item</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Yahoo! Answers</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">annotation</subfield>
  <controlfield tag="005">20200124192255.0</controlfield>
  <controlfield tag="001">3254421</controlfield>
  <datafield tag="711" ind1=" " ind2=" ">
    <subfield code="g">ECIR 2015</subfield>
    <subfield code="a">Advances in Information Retrieval. 37th European Conference on IR Research</subfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Bauhaus-Universität Weimar</subfield>
    <subfield code="0">(orcid)0000-0001-9033-2217</subfield>
    <subfield code="a">Stein, Benno</subfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1241508</subfield>
    <subfield code="z">md5:459da425d3e0b3fe245d2d9335e6444a</subfield>
    <subfield code="u">https://zenodo.org/record/3254421/files/corpus-webis-kiqc-13.zip</subfield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2015-04-02</subfield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">user-webis</subfield>
    <subfield code="o">oai:zenodo.org:3254421</subfield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Bauhaus-Universität Weimar</subfield>
    <subfield code="0">(orcid)0000-0002-9733-2890</subfield>
    <subfield code="a">Hagen, Matthias</subfield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Webis Known-Item Question Corpus 2013 (Webis-KIQC-13)</subfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-webis</subfield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;The Webis Known-Item Question Corpus 2013 (Webis-KIQC-13) contains annotations for 2,755 questions posted on Yahoo! Answers. For each question, 2 annotators were asked to categorize the question as having a known-item information need or not, to identify a ClueWeb09 website representing the known item, and whether false memories are contained in the description of the need. The corpus represents the decisions of the annotators who had discussions for the few questions on which they did not agree initially.&lt;/p&gt;

&lt;p&gt;The corpus contains the IDs of the ClueWeb09 documents representing the known item and an annotated categorization and correction for questions with a false memory.&lt;/p&gt;</subfield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.3254420</subfield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.3254421</subfield>
    <subfield code="2">doi</subfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
All versions This version
Views 227227
Downloads 2121
Data volume 26.1 MB26.1 MB
Unique views 220220
Unique downloads 2121


Cite as