Dataset Open Access

Past Written Texts Dataset

John Ellul; Marina Polycarpou


JSON-LD (schema.org) Export

{
  "description": "<p>The dataset consists of features extracted from older adults&rsquo; text.</p>\n\n<p>The texts were written by the older person either in an electronic mean (eg. older e-mail), or in paper form and were transcribed by the project&#39;s clinical nurses.</p>\n\n<p>The texts were then translated to English using the MyMemory service (https://mymemory.translated.net/), and a series of features were generated that can be used for sentiment analysis.</p>\n\n<p>The list of fields of this dataset is presented below:</p>\n\n<p>- <strong>Part_id</strong>: The user ID, which should be a 4-digit number</p>\n\n<p>- <strong>Date</strong>: The recording date, which follows the &ldquo;DD-MM-YY&rdquo; format (eg. 14 September 2017, is formatted as 14-09-17)</p>\n\n<p>- <strong>Clinical_visit</strong>: As several clinical evaluations were performed to each older adult, this number shows for which clinical evaluation these measurements refer to</p>\n\n<p>- <strong>Transcript</strong>: If the text was written by the older adult (0) or was transcribed by a nurse (1)</p>\n\n<p>- <strong>Language</strong>: The original language of the text (0 = Greek)</p>\n\n<p>- <strong>Text_length, Number_of_sentences, Number_of_words, Number_of_words_per_sentence, Text_entropy</strong>: Statistical Measures</p>\n\n<p>- <strong>Desc_image_ENG_sentiment, Desc_event_sentiment, Prev_text_ENG_sentiment</strong>: Sentiment Analysis</p>\n\n<p>- <strong>Tf-XX</strong>: Term frequency &ndash; Inverse document frequency</p>\n\n<p>- <strong>Tf-pos-XX</strong>: Part of Speech analysis, using tf-idf methodology</p>", 
  "license": "http://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "University of Patras", 
      "@type": "Person", 
      "name": "John Ellul"
    }, 
    {
      "affiliation": "Materia Group Cyprus", 
      "@type": "Person", 
      "name": "Marina Polycarpou"
    }
  ], 
  "url": "https://zenodo.org/record/2670061", 
  "datePublished": "2019-05-07", 
  "contributor": [
    {
      "affiliation": "Univerity of Patras", 
      "@type": "Person", 
      "name": "Evangelia I. Zacharaki"
    }
  ], 
  "keywords": [
    "social media sensing", 
    "sentiment analysis", 
    "text-based sentiment analysis"
  ], 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/9e269077-eed4-4254-b56f-39971d3a728c/Social Media Sensing Texts.csv", 
      "@type": "DataDownload", 
      "fileFormat": "csv"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.2670061", 
  "@id": "https://doi.org/10.5281/zenodo.2670061", 
  "@type": "Dataset", 
  "name": "Past Written Texts Dataset"
}
56
30
views
downloads
All versions This version
Views 5656
Downloads 3030
Data volume 86.8 kB86.8 kB
Unique views 4343
Unique downloads 2525

Share

Cite as