Published April 20, 2018 | Version v1
Dataset Open

MMTF-14K: A Multifaceted Movie Trailer Dataset for Recommendation and Retrieval

  • 1. Politecnico di Milano
  • 2. University Politehnica of Bucharest
  • 3. Johannes Kepler University Linz

Description

The MMTF-14K dataset provides a stable and extensive source for devising and evaluating movie recommender systems. MMTF-14K contains audio and visual descriptors in addition to ratings and metadata for 13,623 Hollywood-type movie trailers. The dataset therefore facilitates research on content-based recommender systems, where content refers not only to metadata, but specifically to visual and auditory characteristics of movies. The data comes also with several baselines benchmarking results for uni-modal and multi-modal recommendation systems. The dataset therefore facilitates research on movie recommendation. In addition, the rich data supports the exploration of other multimedia tasks such as popularity prediction, genre classification, or auto-tagging (aka tag prediction).

The MMTF-14K dataset has been created as a joint research work by Yashar Deldjoo (Politecnico di Milano, Italy), Mihai Gabriel Constantin and Bogdan Ionescu (University Politehnica of Bucharest, Romania), Markus Schedl (Johannes Kepler University Linz, Austria), and Paolo Cremonesi (Politecnico di Milano, Italy).

We would like to acknowledge MovieLens here for providing a stable benchmark dataset of movies containing individual user ratings and metadata which is an enabler for doing research on movie recommendation. Please consider the MovieLens-20M web page for more details on the ratings and tags datasets.

For acknowledgments please use our paper:

@inproceedings{deldjooMMTF14K, 
  title={MMTF-14K: A Multifaceted Movie Trailer Feature Dataset for Recommendation and Retrieval}, 
  author={Deldjoo, Yashar and Constantin, Mihai Gabriel and Schedl, Markus and Ionescu, Bogdan and Cremonesi, Paolo}, 
  booktitle={Proceedings of the 9th ACM Multimedia Systems Conference}, 
  year={2018}, 
  organization={ACM}}

For further inquiries you are free to contact Yashar Deldjoo through his email: deldjooy@acm.org .

Notes

The link to the dataset can be also found in: https://mmprj.github.io/mtrm_dataset/index

Files

Files (4.7 GB)

Name Size Download all
md5:f87db6c087d04503aeee821322a7fc94
838.9 MB Download
md5:bd5158ecadc04618b3ce4e50d98989c0
838.9 MB Download
md5:2d12b2d0b2ba4cf388c837e5424561ea
838.9 MB Download
md5:116de11c601b4ca771679681ad2dc625
838.9 MB Download
md5:a938e0712b625f425677e6c631fd14b9
838.9 MB Download
md5:365b06004a1a3ce859397cbe741b28fd
537.9 MB Download