Dataset Open Access

Past Written Texts Dataset

John Ellul; Marina Polycarpou

Citation Style Language JSON Export

  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.2670061", 
  "author": [
      "family": "John Ellul"
      "family": "Marina Polycarpou"
  "issued": {
    "date-parts": [
  "abstract": "<p>The dataset consists of features extracted from older adults&rsquo; text.</p>\n\n<p>The texts were written by the older person either in an electronic mean (eg. older e-mail), or in paper form and were transcribed by the project&#39;s clinical nurses.</p>\n\n<p>The texts were then translated to English using the MyMemory service (, and a series of features were generated that can be used for sentiment analysis.</p>\n\n<p>The list of fields of this dataset is presented below:</p>\n\n<p>- <strong>Part_id</strong>: The user ID, which should be a 4-digit number</p>\n\n<p>- <strong>Date</strong>: The recording date, which follows the &ldquo;DD-MM-YY&rdquo; format (eg. 14 September 2017, is formatted as 14-09-17)</p>\n\n<p>- <strong>Clinical_visit</strong>: As several clinical evaluations were performed to each older adult, this number shows for which clinical evaluation these measurements refer to</p>\n\n<p>- <strong>Transcript</strong>: If the text was written by the older adult (0) or was transcribed by a nurse (1)</p>\n\n<p>- <strong>Language</strong>: The original language of the text (0 = Greek)</p>\n\n<p>- <strong>Text_length, Number_of_sentences, Number_of_words, Number_of_words_per_sentence, Text_entropy</strong>: Statistical Measures</p>\n\n<p>- <strong>Desc_image_ENG_sentiment, Desc_event_sentiment, Prev_text_ENG_sentiment</strong>: Sentiment Analysis</p>\n\n<p>- <strong>Tf-XX</strong>: Term frequency &ndash; Inverse document frequency</p>\n\n<p>- <strong>Tf-pos-XX</strong>: Part of Speech analysis, using tf-idf methodology</p>", 
  "title": "Past Written Texts Dataset", 
  "type": "dataset", 
  "id": "2670061"
All versions This version
Views 225226
Downloads 139139
Data volume 402.4 kB402.4 kB
Unique views 195196
Unique downloads 124124


Cite as