Software Open Access

cligs/textbox: Almost Summer Release

Christof Schöch; Ulrike; José Calvo; KatrinB

In this release, the existing collections have been updated to be more consistent and to improve validation. The main changes are:

  • The keywords in the text classification section of the TEI header have been normalized and hierarchized in all the collections.
  • A TEI keyword list and a schematron file controlling term values have been created for each collection.
  • The schemas for TEI master files and annotated versions have been merged to a single, common schema for all TEI files. See also where the schema files are hosted.
  • In the schema and TEI files, a CLiGS namespace has been introduced for:
    • a CLiGS specific attribute importance used to indicate the importance of each genre assignment in case of several different assignments
    • non-TEI sentence and word level attributes resulting from NLP annotations with FreeLing
  • Folder names have been adjusted to be consistent for all collections in the textbox.
  • For details, see the version history in the "next" branch:

Files (526.1 MB)
Name Size
526.1 MB Download
All versions This version
Views 900119
Downloads 158
Data volume 5.0 GB4.2 GB
Unique views 880111
Unique downloads 126


Cite as