There is a newer version of the record available.

Published August 25, 2021 | Version v1
Dataset Open

IMDB Reviews

Creators

Description

IMDB Reviews: contains 348,415 user reviews about 50,000 movies. The scores for the movies, in a range [0,10], were discretized so that 10 classes are considered for classification. This is a highly imbalanced dataset.

The files:
texts.txt: Document set (text). One per line.
score.txt: Document class whose index is associated with texts.txt
split_<k>.pkl:  pandas DataFrame with k-cross validation partition.

Files

score.txt

Files (613.0 MB)

Name Size Download all
md5:5c8a43a6022e6d91df63b95bd14b6d21
760.1 kB Preview Download
md5:80a6359e9bb1018abdc5c23f5a7c9332
16.1 MB Download
md5:667e2056c6170280a0ceaf78943d445a
8.1 MB Download
md5:6522873758814e051d6d03102f4f51db
588.0 MB Preview Download

Additional details