Dataset Open Access

ArguAna TripAdvisor

Henning Wachsmuth; Martin Trenkmann; Benno Stein; Gregor Engels; Tsvetomira Palakarska


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <controlfield tag="005">20200814095421.0</controlfield>
  <controlfield tag="001">3973241</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Bauhaus-Universität Weimar</subfield>
    <subfield code="a">Martin Trenkmann</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Bauhaus-Universität Weimar</subfield>
    <subfield code="a">Benno Stein</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Paderborn University</subfield>
    <subfield code="a">Gregor Engels</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Bauhaus-Universität Weimar</subfield>
    <subfield code="a">Tsvetomira Palakarska</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">12095900</subfield>
    <subfield code="z">md5:ef11039ebbd5088784cdf2d37bc0b65f</subfield>
    <subfield code="u">https://zenodo.org/record/3973241/files/arguana-tripadvisor-annotated-plus-software-v1.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">9608511</subfield>
    <subfield code="z">md5:a450bbdbbf888fb171783b62aa81e332</subfield>
    <subfield code="u">https://zenodo.org/record/3973241/files/arguana-tripadvisor-annotated-v2.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">257404859</subfield>
    <subfield code="z">md5:85e7c4f4142fc6bfdec1ad671cd78cdb</subfield>
    <subfield code="u">https://zenodo.org/record/3973241/files/arguana-tripadvisor-unannotated-v2.zip</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2014-04-01</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o">oai:zenodo.org:3973241</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Paderborn University</subfield>
    <subfield code="a">Henning Wachsmuth</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">ArguAna TripAdvisor</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;An&amp;nbsp;English&amp;nbsp;corpus for studying local sentiment flows and aspect-based sentiment analysis.&amp;nbsp;It contains 2100 hotel reviews balanced with respect to the reviews&amp;rsquo; sentiment scores. All reviews are segmented into subsentence-level statements that have then been manually classified as a fact, a positive, or a negative opinion. Also, all hotel aspects mentioned in the reviews have been annotated as such:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;arguana-tripadvisor-annotated-plus-software-v1.zip&lt;/li&gt;
	&lt;li&gt;arguana-tripadvisor-annotated-v2.zip&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In addition, we provide nearly 200k further hotel reviews without manual annotations:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;v1 upon request&lt;/li&gt;
	&lt;li&gt;arguana-tripadvisor-unannotated-v2.zip&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The corpus is free-to-use for scientific purposes, not for commercial applications. In version 2,&amp;nbsp;the annotated XMI files have been changed according to a new underlying type system that is more easily extendable. Notice that some adaptations of the software of version 1 are necessary to make it work with version 2.&lt;/p&gt;

&lt;p&gt;In case you publish any results related to the ArguAna TripAdvisor corpus, please cite our CICLing 2014 &lt;a href="https://webis.de/publications.html?q=bibid:stein_2014b"&gt;paper&lt;/a&gt;.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.3973240</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.3973241</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
87
30
views
downloads
All versions This version
Views 8787
Downloads 3030
Data volume 1.8 GB1.8 GB
Unique views 7474
Unique downloads 2020

Share

Cite as