Dataset Open Access

News Headline Sentiment Dataset

Chang Wei Tan; Christoph Bergmeir; Francois Petitjean; Geoffrey I Webb


JSON-LD (schema.org) Export

{
  "description": "<p>This dataset is part of the Monash, UEA &amp;&nbsp;UCR time series regression repository.&nbsp;<a href=\"http://tseregression.org/\">http://tseregression.org/</a></p>\n\n<p>The goal of this dataset is to predict sentiment score for news headline.&nbsp;This dataset contains 83164 time series obtained from the News Popularity in Multiple Social Media Platforms dataset from the UCI repository.&nbsp;This is a large data set of news items and their respective social feedback on multiple platforms: Facebook, Google+ and LinkedIn.&nbsp;The collected data relates to a period of 8 months, between November 2015 and July 2016, accounting for about 100,000 news items on four different topics: economy, microsoft, obama and palestine.&nbsp;This data set is tailored for evaluative comparisons in predictive analytics tasks, although allowing for tasks in other research areas such as topic detection and tracking, sentiment analysis in short text, first story detection or news recommendation.&nbsp;The time series has 3 dimensions.&nbsp;<br>\n<br>\nPlease refer to <a href=\"https://archive.ics.uci.edu/ml/datasets/News+Popularity+in+Multiple+Social+Media+Platforms\">https://archive.ics.uci.edu/ml/datasets/News+Popularity+in+Multiple+Social+Media+Platforms</a>&nbsp;for more details<br>\n<br>\nCitation request<br>\nNuno Moniz and Luis Torgo (2018), Multi-Source Social Feedback of Online News Feeds, CoRR</p>", 
  "license": "https://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "Monash University", 
      "@id": "https://orcid.org/0000-0001-8377-3241", 
      "@type": "Person", 
      "name": "Chang Wei Tan"
    }, 
    {
      "affiliation": "Monash University", 
      "@id": "https://orcid.org/0000-0002-3665-9021", 
      "@type": "Person", 
      "name": "Christoph Bergmeir"
    }, 
    {
      "affiliation": "Monash University", 
      "@id": "https://orcid.org/0000-0001-5334-3574", 
      "@type": "Person", 
      "name": "Francois Petitjean"
    }, 
    {
      "affiliation": "Monash University", 
      "@id": "https://orcid.org/0000-0001-9963-5169", 
      "@type": "Person", 
      "name": "Geoffrey I Webb"
    }
  ], 
  "url": "https://zenodo.org/record/3902718", 
  "datePublished": "2020-06-21", 
  "keywords": [
    "time series", 
    "regression"
  ], 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/9f262219-d72e-47f2-8a47-43119943d965/NewsHeadlineSentiment_TEST.ts", 
      "encodingFormat": "ts", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/9f262219-d72e-47f2-8a47-43119943d965/NewsHeadlineSentiment_TRAIN.ts", 
      "encodingFormat": "ts", 
      "@type": "DataDownload"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.3902718", 
  "@id": "https://doi.org/10.5281/zenodo.3902718", 
  "@type": "Dataset", 
  "name": "News Headline Sentiment Dataset"
}
2,244
1,184
views
downloads
All versions This version
Views 2,2442,244
Downloads 1,1841,184
Data volume 44.7 GB44.7 GB
Unique views 2,0462,046
Unique downloads 794794

Share

Cite as