# Lo Congrès websites Corpus v1.0 (june 2024)

This corpus was automatically compiled from Lo Congrès websites (locongres.org, dicodoc.eu, revirada.eu, votz.eu, lengasocietat.eu, lafarga.eu, ninon.eu, api.locongres.com, afichas.locongres.com, premsa.locongres.com, and some private online tools). This work was made by Lo Congrès permanent de la lenga occitana (https://locongres.org) as a part of its project "Còrpus" (http://abrac.at/corpusproject).

It contains csv files with occitan sentences aligned with their french translations, with the format :
occitan sentence§occitan variety code§translation§translation language

The language codes used for occitan varieties are :
oc-aranes-grclass : Aranese gascon occitan
oc-auvern-grclass : Auvergnate occitan
oc-cisaup-grclass : Cisalpine vivaroalpine occitan
oc-gascon-grclass : Gascon occitan (except for aranese)
oc-lemosin-grclass : Limousine occitan
oc-lengadoc-grclass : Languedocien occitan
oc-provenc-grclass : Provençal occitan
oc-vivaraup-grclass : Vivaroalpine occitan (except for cisalpine)

This corpus includes some of the sentences you can find in the Occitan Corpus from Lo Congrès news.

## License
Lo Congrès websites Corpus is distributed under the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0).

