Published August 25, 2021
| Version v1
Dataset
Open
IMDB Reviews
Creators
Description
IMDB Reviews: contains 348,415 user reviews about 50,000 movies. The scores for the movies, in a range [0,10], were discretized so that 10 classes are considered for classification. This is a highly imbalanced dataset.
The files:
texts.txt: Document set (text). One per line.
score.txt: Document class whose index is associated with texts.txt
split_<k>.pkl: pandas DataFrame with k-cross validation partition.