Dataset Open Access

Past Written Texts Dataset

John Ellul; Marina Polycarpou

JSON-LD ( Export

  "description": "<p>The dataset consists of features extracted from older adults&rsquo; text.</p>\n\n<p>The texts were written by the older person either in an electronic mean (eg. older e-mail), or in paper form and were transcribed by the project&#39;s clinical nurses.</p>\n\n<p>The texts were then translated to English using the MyMemory service (, and a series of features were generated that can be used for sentiment analysis.</p>\n\n<p>The list of fields of this dataset is presented below:</p>\n\n<p>- <strong>Part_id</strong>: The user ID, which should be a 4-digit number</p>\n\n<p>- <strong>Date</strong>: The recording date, which follows the &ldquo;DD-MM-YY&rdquo; format (eg. 14 September 2017, is formatted as 14-09-17)</p>\n\n<p>- <strong>Clinical_visit</strong>: As several clinical evaluations were performed to each older adult, this number shows for which clinical evaluation these measurements refer to</p>\n\n<p>- <strong>Transcript</strong>: If the text was written by the older adult (0) or was transcribed by a nurse (1)</p>\n\n<p>- <strong>Language</strong>: The original language of the text (0 = Greek)</p>\n\n<p>- <strong>Text_length, Number_of_sentences, Number_of_words, Number_of_words_per_sentence, Text_entropy</strong>: Statistical Measures</p>\n\n<p>- <strong>Desc_image_ENG_sentiment, Desc_event_sentiment, Prev_text_ENG_sentiment</strong>: Sentiment Analysis</p>\n\n<p>- <strong>Tf-XX</strong>: Term frequency &ndash; Inverse document frequency</p>\n\n<p>- <strong>Tf-pos-XX</strong>: Part of Speech analysis, using tf-idf methodology</p>", 
  "license": "", 
  "creator": [
      "affiliation": "University of Patras", 
      "@type": "Person", 
      "name": "John Ellul"
      "affiliation": "Materia Group Cyprus", 
      "@type": "Person", 
      "name": "Marina Polycarpou"
  "url": "", 
  "datePublished": "2019-05-07", 
  "keywords": [
    "social media sensing", 
    "sentiment analysis", 
    "text-based sentiment analysis"
  "contributor": [
      "affiliation": "Univerity of Patras", 
      "@type": "Person", 
      "name": "Evangelia I. Zacharaki"
  "@context": "", 
  "distribution": [
      "contentUrl": " Media Sensing Texts.csv", 
      "encodingFormat": "csv", 
      "@type": "DataDownload"
  "identifier": "", 
  "@id": "", 
  "@type": "Dataset", 
  "name": "Past Written Texts Dataset"
All versions This version
Views 225226
Downloads 139139
Data volume 402.4 kB402.4 kB
Unique views 195196
Unique downloads 124124


Cite as