Published January 10, 2022 | Version v1
Dataset Open

Deezer Podcast Dataset for Topic Modeling

  • 1. Deezer Research

Description

We release a new dataset consisting of podcast metadata (title and description) for 29 539 shows. This dataset can be used to reproduce the experiments from the article Topic Modeling on Podcast Short-Text Metadata accepted at the ECIR 2022 conference.

More information about this data and how it should be used in experiments can be found in our paper and GitHub repository.

Please cite our paper if you use the code or data.

Files

Files (12.0 MB)

Name Size Download all
md5:d161ba83e0dfc9efb73f993a6c387dff
12.0 MB Download