There is a newer version of the record available.

Published May 12, 2023 | Version 2.0
Dataset Open

Polarity Dataset v2.0

  • 1. Cornell

Description

Preliminary steps were taken to remove rating information from the text files, but only the rating information upon which the rating decision was based is guaranteed to have been removed. Thus, if the original review contains several instances of rating information, potentially given in different forms, those not recognized as valid ratings remain part of the review text. The reviews are split into sentences in the .csv file, which are labeled with the review they come from, as well as the sentiment of the overall review.

Files

movie_review.csv

Files (9.2 MB)

Name Size Download all
md5:ba24c5ffac2bdaaf415c12bae4abcd69
9.2 MB Preview Download

Additional details

References

  • Bo Pang and Lillian Lee, A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts, Proceedings of ACL 2004.
  • Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan, Thumbs up? Sentiment Classification using Machine Learning Techniques, Proceedings of EMNLP 2002.