Conference paper Open Access

Performance Comparison of Ad-hoc Retrieval Models over Full-text vs. Titles of Documents

Saleh, Ahmed; Beck, Tilman; Galke, Lukas; Scherp, Ansgar


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="942" ind1=" " ind2=" ">
    <subfield code="a">2019-11-25</subfield>
  </datafield>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Information Retrieval</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Learning to Rank</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Deep Learning</subfield>
  </datafield>
  <controlfield tag="005">20200120171528.0</controlfield>
  <datafield tag="500" ind1=" " ind2=" ">
    <subfield code="a">This is the author's version of the work. It is posted here for your personal use, not for redistribution. The definitive Version of Record was published in the proceedings of the International Conference on Asian Digital Libraries ICADL 2018,   https://doi.org/10.1007/978-3-030-04257-8_30.</subfield>
  </datafield>
  <controlfield tag="001">2547476</controlfield>
  <datafield tag="711" ind1=" " ind2=" ">
    <subfield code="d">19-22 November 2018</subfield>
    <subfield code="g">ICADL 2018</subfield>
    <subfield code="a">International Conference on Asian Digital Libraries</subfield>
    <subfield code="c">Hamilton, New Zealand</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">ZBW - Leibniz Information Centre for Economics</subfield>
    <subfield code="a">Beck, Tilman</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">ZBW - Leibniz Information Centre for Economics</subfield>
    <subfield code="a">Galke, Lukas</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Stirling</subfield>
    <subfield code="a">Scherp, Ansgar</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">395709</subfield>
    <subfield code="z">md5:f741c99d346f3cde927ceec06921fcec</subfield>
    <subfield code="u">https://zenodo.org/record/2547476/files/2018-ICADL-Comparison-of-the-Performance-of-Ad-hoc-Retrieval-Models-over-Titles-vs-Fulltexts.pdf</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="y">Conference website</subfield>
    <subfield code="u">https://icadl2018.org</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2018-11-15</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire</subfield>
    <subfield code="p">user-moving-h2020</subfield>
    <subfield code="o">oai:zenodo.org:2547476</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">ZBW - Leibniz Information Centre for Economics</subfield>
    <subfield code="a">Saleh, Ahmed</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Performance Comparison of Ad-hoc Retrieval Models over Full-text vs. Titles of Documents</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-moving-h2020</subfield>
  </datafield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">693092</subfield>
    <subfield code="a">Training towards a society of data-savvy information professionals to enable open leadership innovation</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;pre&gt;While there are many studies on information retrieval models using full-text, there are presently no comparison studies of full-text retrieval vs. retrieval only over the titles of documents. On the one hand, the full-text of documents like scientific papers is not always available due to, e.,g., copyright policies of academic publishers. &lt;/pre&gt;

&lt;pre&gt;On the other hand, conducting a search based on titles alone has strong limitations. Titles are short and therefore may not contain enough information to yield satisfactory search results. In this paper, we compare different retrieval models regarding their search performance on the full-text vs. only titles of documents. &lt;/pre&gt;

&lt;pre&gt;We use different datasets, including the three digital library datasets:  EconBiz, IREON, and PubMed. The results show that it is possible to build effective title-based retrieval models that provide competitive results comparable to full-text retrieval. The difference between the average evaluation results of the best title-based retrieval models is only 3% less than those of the best full-text-based retrieval models. &lt;/pre&gt;

&lt;pre&gt;&amp;nbsp;&lt;/pre&gt;</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.1007/978-3-030-04257-8_30</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">conferencepaper</subfield>
  </datafield>
</record>
47
38
views
downloads
Views 47
Downloads 38
Data volume 15.0 MB
Unique views 42
Unique downloads 38

Share

Cite as