===========================================================
README — Dihunamb ! Issue No. 01 (1905) – Parallel Corpus
===========================================================

TITLE
------
DIHUNAMB – Breton Periodical, Issue No. 01 (1905) – Parallel Corpus (Breton Vannetais / French)

AUTHORS (Original publication, 1905)
------------------------------------
Loeiz Herrieu (Editor-in-chief) and Andréù Mellag (Director)

SOURCE
------
Title: DIHUNAMB !
Series/Collection: DIHUNAMB – Breton periodical (Issue No. 01, 1905)
Author: Loeiz HERRIEU (Editor‑in‑chief) and Andréu MELLAG (Director)
Edition Note: First year of publication (1905‑1907), Issue No. 01 (1905)
Publisher/Printer: Burèu en dastumaden, Ru Karnot, en Oriant (Lorient)
Language: Breton (Vannetais dialect)
Document Type: Periodical / Journal
Date of Publication: 1905
Digitized Source: Gallica – Bibliothèque nationale de France
URL: https://gallica.bnf.fr/ark:/12148/bpt6k6377266g#

DATASET CURATION (2025)
-----------------------
Michel Mermet (Project manager, IAgwened project)
Institut Culturel de Bretagne

LICENCE
--------
CC-BY 4.0 International — The data may be reused, modified, and redistributed, provided that appropriate credit is given.

RIGHTS
------
All rights reserved for the original printed version / Peb gwir miret strizh.
The parallel corpus is released under CC-BY 4.0 International.

PROJECT
-------
IAgwened – Breton Language and AI Project  
Independent research and development group creating open linguistic resources for the Breton language, with a focus on the Vannetais (Gwenedeg) dialect.  
Zenodo community: https://zenodo.org/communities/iagwened (to be confirmed)

DESCRIPTION
------------
This file contains a sentence-aligned parallel corpus in Breton Vannetais and French.
The content is derived from the periodical *Dihunamb !* (Issue No. 01, 1905) and adapted for research use in natural language processing, machine translation, and speech technologies.

FILE STRUCTURE
---------------
- The first section includes metadata lines prefixed by "# ".
- Each subsequent line contains a Breton Vannetais sentence and its French translation, separated by a vertical bar "|".
- Encoding: UTF-8
- Example:
    Dihunamb ! | Réveillons-nous !

CREDITS / REVIEWERS (2025)
--------------------------
Annotations and linguistic revision were carried out by:

MMT = Michel Mermet (Project coordination)
LC  = Loïc Cheveau (Translation, linguistic review)
FL  = François Louis (Translation, linguistic review)
FJ  = Fañch Jestin (Data validation)
PLF = Pascal Le Folgoc (Data validation)

LEXICAL REFERENCES
------------------
FF = FAVEREAU, Francis. 1992, Dictionnaire du breton contemporain bilingue, Morlaix : Skol Vreizh.
AB = Albert Boché (1927–2012) – néologismes relevés et publiés dans :
     MERMET, Michel. 2022, Goulavarioù Albert Boché – Contribution au lexique breton-français, Queven : Skoliuz.

Initials are used within the annotated corpus (KorpusGW_FINAL_ANNOTATED_1.0.txt) to indicate the reviewer responsible for each comment.


ANNOTATION LANGUAGE
-------------------
All annotations within the file `KorpusGW_FINAL_ANNOTATED_1.0.txt` are written in French.
This choice reflects the linguistic working environment of the IAgwened project, focused on Breton (Vannetais dialect) and French parallel corpora.

RECOMMENDED CITATION
---------------------
MERMET, Michel. (2025).
DIHUNAMB – Breton Periodical, Issue No. 01 (1905) – Parallel Corpus (Breton Vannetais / French) [Data set].
Institut Culturel de Bretagne (IA Gwened Project) / Zenodo.
https://doi.org/10.5281/zenodo.17931307

Original source edited by Loeiz Herrieu and Andréu Mellag (1905).


License: CC-BY 4.0 International.

CONTACT
--------
IAgwened Project  
Contact: Michel Mermet  
Email: michel.mermet56@gmail.com

===========================================================
Version: 1.0 — 2025-12-21
===========================================================