Dataset Open Access

French Novel Corpus (ELTeC-fra): April 2021 release

Christof Schöch; Lou Burnard

Data curator(s)
Geißel, Pia; Fileva, Evgenia

This is the French novel corpus for the ELTeC, the European Literary Text Collection, produced by the COST Action Distant Reading for European Literary History (CA16204, The current version is v1.0.1.

An overview over the authors and works represented in the collection can be gained here:


  • Collection editors: Christof Schöch and Lou Burnard
  • Contributors: Pia Geißel, Rezearta Murati, Evegnia Fileva
  • Sources: Bibliothèque nationale de France (Gallica), Ebooks libres et gratuits / Bibliothèque électronique du Québec, CLiGS textbox, Wikisource,, Atramenta, OBVIL, Project Gutenberg.


All texts included in this collection are in the public domain. No claim to copyright or similar protections is made for the composition of the corpus, the collection and presentation of the metadata, or the transcription and encoding of the texts.

Citation suggestion

If you use this corpus in your research or teaching, please follow good scholarly practice and use the following citation suggestion to acknowledge your source:

  • French Novel Corpus (ELTeC-fra), edited by Christof Schöch and Lou Burnard. Version v1.0.1, April 2021. In: European Literary Text Collection (ELTeC). COST Action Distant Reading for European Literary History. DOI:

Release v1.0.1

Release v1.0.1 brings minor improvements to corpus metadata (and data on translations).


ELTeC has been created in the context of the COST Action 'Distant Reading for European Literary History' (CA16204), a COST Action funded by the COST Association as part of the Horizon 2020 Framework Programme of the EU.
Files (37.4 MB)
Name Size
37.4 MB Download
21.8 kB Download
3.8 kB Download
All versions This version
Views 422129
Downloads 5431
Data volume 995.8 MB523.5 MB
Unique views 338110
Unique downloads 4724


Cite as