Dataset Open Access

The Clarity Software Documentation Dataset

Anonymous Authors


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/ed73a9e4-4d5e-4c2c-b9cf-98b9c742b571/Clarity-Data.zip"
      }, 
      "checksum": "md5:b0017a0ed1495c5942c33835889172fa", 
      "bucket": "ed73a9e4-4d5e-4c2c-b9cf-98b9c742b571", 
      "key": "Clarity-Data.zip", 
      "type": "zip", 
      "size": 12328146854
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/ed73a9e4-4d5e-4c2c-b9cf-98b9c742b571/README.md"
      }, 
      "checksum": "md5:34d826f7f0a64d475eb88d4e6736294a", 
      "bucket": "ed73a9e4-4d5e-4c2c-b9cf-98b9c742b571", 
      "key": "README.md", 
      "type": "md", 
      "size": 1493
    }
  ], 
  "owners": [
    58208
  ], 
  "doi": "10.5281/zenodo.5822884", 
  "stats": {
    "version_unique_downloads": 14.0, 
    "unique_views": 60.0, 
    "views": 71.0, 
    "version_views": 82.0, 
    "unique_downloads": 11.0, 
    "version_unique_views": 68.0, 
    "volume": 86297039922.0, 
    "version_downloads": 18.0, 
    "downloads": 15.0, 
    "version_volume": 123281480484.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.5822884", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.5821839", 
    "bucket": "https://zenodo.org/api/files/ed73a9e4-4d5e-4c2c-b9cf-98b9c742b571", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.5821839.svg", 
    "html": "https://zenodo.org/record/5822884", 
    "latest_html": "https://zenodo.org/record/5822884", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.5822884.svg", 
    "latest": "https://zenodo.org/api/records/5822884"
  }, 
  "conceptdoi": "10.5281/zenodo.5821839", 
  "created": "2022-01-05T18:53:32.554837+00:00", 
  "updated": "2022-01-06T01:48:51.156272+00:00", 
  "conceptrecid": "5821839", 
  "revision": 2, 
  "id": 5822884, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.5822884", 
    "description": "<p>This repository holds the Clarity Dataset which is a companion to the SANER&#39;22 entitled &quot;An Empirical Investigation into the Use of Image Captioning for Automated Software Documentation&quot;. The dataset consists of 45,998 captions&nbsp;10,204 GUI screenshots and xml metadata files (akin to the &quot;html&quot; for stipulating GUIs)&nbsp;of Android applications.&nbsp;The NL captions were obtained from human labelers, underwent several quality control mechanisms, and contain both high- (screen-level) and low-(component)&nbsp;level descriptions of screen functionality. This dataset is meant as a new source of data to augment techniques for software documentation that can take advantage of the rich pixel-based information contained within screenshots.</p>", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "title": "The Clarity Software Documentation Dataset", 
    "relations": {
      "version": [
        {
          "count": 2, 
          "index": 1, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "5821839"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "5822884"
          }
        }
      ]
    }, 
    "version": "1.0", 
    "keywords": [
      "Software Documentation", 
      "Android", 
      "Screenshots", 
      "Captions"
    ], 
    "publication_date": "2022-01-05", 
    "creators": [
      {
        "affiliation": "Anonymous", 
        "name": "Anonymous Authors"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "type": "dataset", 
      "title": "Dataset"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.5821839", 
        "relation": "isVersionOf"
      }
    ]
  }
}
82
18
views
downloads
All versions This version
Views 8271
Downloads 1815
Data volume 123.3 GB86.3 GB
Unique views 6860
Unique downloads 1411

Share

Cite as