Published March 12, 2022
| Version 0.2
Dataset
Open
MOBO: The MOvie and BOok reviews dataset
Description
The MOBO dataset.
The MOvie and BOok reviews dataset is a collection made up of movie and book reviews, paired with their related plots.
The reviews come from different publicly available datasets: the Stanford's IMDB movie reviews [1], the GoodReads [2] and the Amazon reviews dataset [3]. With the help of 15 annotators, we further labeled more than 18,000 reviews' sentences (~6000 per corpus), marking the sentence polarity (Positive, Negative), or whether a sentence describes its corresponding movie/book Plot, or none of the above (None). In the dataset folder, we have shared an excerpt of the annotated sentences for each dataset.
Files
MOBO Dataset.zip
Files
(1.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:08a43c7d18340052e4df3bbaa7b1edd3
|
1.8 MB | Preview Download |
Additional details
Funding
- UK Research and Innovation
- Turing AI Fellowship: Event-Centric Framework for Natural Language Understanding EP/V020579/1
- UK Research and Innovation
- Learning from COVID-19: An AI-enabled evidence-driven framework for claim veracity assessment during pandemics EP/V048597/1
- UK Research and Innovation
- Twenty20Insight EP/T017112/1
References
- A Disentangled Adversarial Neural Topic Model for Separating Opinions from Plots in User Reviews (Pergola et al., NAACL 2021)