Dataset Open Access

Fair RecSys Datasets

Kowald Dominik


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">multimedia recommender systems</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">fairness</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">popularity bias</subfield>
  </datafield>
  <controlfield tag="005">20220302155528.0</controlfield>
  <controlfield tag="001">6123879</controlfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2099899</subfield>
    <subfield code="z">md5:537b5cdaf8c02e34a2552cd47eb58a82</subfield>
    <subfield code="u">https://zenodo.org/record/6123879/files/anime.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2985860</subfield>
    <subfield code="z">md5:96cddcdc4dbb8b62ea1e7b96933415e7</subfield>
    <subfield code="u">https://zenodo.org/record/6123879/files/book.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">9161489</subfield>
    <subfield code="z">md5:57a773a0c30c097dfc987a3fdb0b322e</subfield>
    <subfield code="u">https://zenodo.org/record/6123879/files/lfm.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2040657</subfield>
    <subfield code="z">md5:6a879d1fc781e0b37c42bbbdc5f27deb</subfield>
    <subfield code="u">https://zenodo.org/record/6123879/files/ml.zip</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2022-02-17</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o">oai:zenodo.org:6123879</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Know-Center GmbH, TU Graz</subfield>
    <subfield code="0">(orcid)0000-0003-3230-6234</subfield>
    <subfield code="a">Kowald Dominik</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Fair RecSys Datasets</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;Four multimedia recommender systems datasets to study popularity bias and fairness:&lt;/p&gt;

&lt;ol&gt;
	&lt;li&gt;Last.fm (lfm.zip), based on the LFM-1b dataset of JKU Linz (http://www.cp.jku.at/datasets/LFM-1b/)&lt;/li&gt;
	&lt;li&gt;MovieLens (ml.zip), based on MovieLens-1M dataset (https://grouplens.org/datasets/movielens/1m/)&lt;/li&gt;
	&lt;li&gt;BookCrossing (book.zip), based on the BookCrossing dataset of Uni Freiburg (http://www2.informatik.uni-freiburg.de/~cziegler/BX/)&lt;/li&gt;
	&lt;li&gt;MyAnimeList (anime.zip), based on the MyAnimeList dataset of Kaggle (https://www.kaggle.com/CooperUnion/anime-recommendations-database)&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Each dataset contains of user interactions (user_events.txt) and three user groups that differ in their inclination to popular/mainstream items: LowPop (low_main_users.txt), MedPop (med_main_users.txt), and HighPop (high_main_users.txt).&lt;/p&gt;

&lt;p&gt;The format of the three user files are &amp;quot;user,mainstreaminess&amp;quot;&lt;/p&gt;

&lt;p&gt;The format of the user-events files are &amp;quot;user,item,preference&amp;quot;&lt;/p&gt;

&lt;p&gt;Example Python-code for analyzing the datasets as well as more information on the user groups can be found on Github (https://github.com/domkowald/FairRecSys) and on Arxiv (https://arxiv.org/abs/2203.00376)&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.6123878</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.6123879</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
306
38
views
downloads
All versions This version
Views 306306
Downloads 3838
Data volume 145.6 MB145.6 MB
Unique views 252252
Unique downloads 2121

Share

Cite as