Published September 20, 2023
| Version v2023-09-20
Dataset
Restricted
Collection de romans français du vingtième siècle (1970-1999)
Description
Corpus of 600 French novels from three time periods (1970s, 1980s, 1990s) and belonging to four groups (crime fiction, sentimental novels, science-fiction novel, general literary novel). Each combined group is represented with 50 novels for a total of 600 novels. This version of the corpus has been linguistically annotated using TreeTagger in TXM (lemma, pos) and is archived here in a binary format suitable for import into TXM.
Note that the TDM exception in the EU-DSM Directive allows building and using, but not publicly sharing, this corpus of in-copyright texts.
Files
Additional details
Related works
- Is supplement to
- Other: https://www.zeta-project.eu (URL)