Video/Audio Open Access

Standardizing linguistic data: method and tools for annotating(pre-orthographic) French

Gabay, Simon; Clérice, Thibault; Camps, Jean-Baptiste; Tanguy, Jean-Baptiste; Gille-Levenson, Matthias


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/a4303161-da39-499e-a82c-696a7ff983af/Gabay-al-1.webm"
      }, 
      "checksum": "md5:bdb7905bc80f09612a21f6d967723254", 
      "bucket": "a4303161-da39-499e-a82c-696a7ff983af", 
      "key": "Gabay-al-1.webm", 
      "type": "webm", 
      "size": 36479539
    }
  ], 
  "owners": [
    124694
  ], 
  "doi": "10.5281/zenodo.4084499", 
  "stats": {
    "version_unique_downloads": 702.0, 
    "unique_views": 25.0, 
    "views": 25.0, 
    "version_views": 25.0, 
    "unique_downloads": 702.0, 
    "version_unique_views": 25.0, 
    "volume": 32393830632.0, 
    "version_downloads": 888.0, 
    "downloads": 888.0, 
    "version_volume": 32393830632.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.4084499", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.4084498", 
    "bucket": "https://zenodo.org/api/files/a4303161-da39-499e-a82c-696a7ff983af", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.4084498.svg", 
    "html": "https://zenodo.org/record/4084499", 
    "latest_html": "https://zenodo.org/record/4084499", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.4084499.svg", 
    "latest": "https://zenodo.org/api/records/4084499"
  }, 
  "conceptdoi": "10.5281/zenodo.4084498", 
  "created": "2020-10-13T08:29:28.644310+00:00", 
  "updated": "2020-10-13T08:29:30.088722+00:00", 
  "conceptrecid": "4084498", 
  "revision": 1, 
  "id": 4084499, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.4084499", 
    "description": "<p>With the development of big corpora of various periods, it becomescrucial to standardise linguistic annotation (e.g.lemmas, POS tags,morphological annotation) to increase the interoperability of the dataproduced, despite diachronic variations. In the present paper, wedescribe both methodologically (by proposing annotation principles)and technically (by creating the required training data and therelevant models) the production of a linguistic tagger for (early)modern French (16-18th c.), taking as much as possible into accountalready existing standards for contemporary and, especially, medievalFrench</p>", 
    "language": "eng", 
    "title": "Standardizing linguistic data: method and tools for annotating(pre-orthographic) French", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "4084498"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "4084499"
          }
        }
      ]
    }, 
    "keywords": [
      "linguistic annotation, pre-orthographic language, lemmatisation,POS-tagging"
    ], 
    "publication_date": "2020-10-13", 
    "creators": [
      {
        "affiliation": "Universit\u00e9s de Neuch\u00e2tel et de Gen\u00e8ve", 
        "name": "Gabay, Simon"
      }, 
      {
        "affiliation": "\u00c9cole des Chartes", 
        "name": "Cl\u00e9rice, Thibault"
      }, 
      {
        "affiliation": "\u00c9cole des Chartes", 
        "name": "Camps, Jean-Baptiste"
      }, 
      {
        "affiliation": "Sorbonne Universit\u00e9", 
        "name": "Tanguy, Jean-Baptiste"
      }, 
      {
        "affiliation": "\u00c9cole normale sup\u00e9rieure de Lyon", 
        "name": "Gille-Levenson, Matthias"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "type": "video", 
      "title": "Video/Audio"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.4084498", 
        "relation": "isVersionOf"
      }
    ]
  }
}
25
888
views
downloads
All versions This version
Views 2525
Downloads 888888
Data volume 32.4 GB32.4 GB
Unique views 2525
Unique downloads 702702

Share

Cite as