Published February 22, 2018 | Version v2
Technical note Open

Annotated Corpus for Occitan : Corpus Description

  • 1. Université de Toulouse, CLLE, UT2J & CNRS
  • 2. Université de Toulouse, CLLE, CNRS

Description

This is the corpus description of a set of data containing a collection of texts in several dialects of Occitan (lengadocian, gascon, provençau, vivaro-aupenc, auvernhàs, lemosin) manually annotated with parts-of-speech and lemmas available in :

DOI:10.5281/zenodo.1182949.

 

Files

Files (36.9 kB)

Name Size Download all
md5:f662c17d2bfff0f4f06699acd23a3742
36.9 kB Download