Published May 28, 2018 | Version v.4.0.0
Software Open

cligs/textbox: Almost Summer Release

  • 1. University of Trier
  • 2. University of Würzburg

Description

In this release, the existing collections have been updated to be more consistent and to improve validation. The main changes are:

  • The keywords in the text classification section of the TEI header have been normalized and hierarchized in all the collections.
  • A TEI keyword list and a schematron file controlling term values have been created for each collection.
  • The schemas for TEI master files and annotated versions have been merged to a single, common schema for all TEI files. See also https://github.com/cligs/reference where the schema files are hosted.
  • In the schema and TEI files, a CLiGS namespace has been introduced for:
    • a CLiGS specific attribute importance used to indicate the importance of each genre assignment in case of several different assignments
    • non-TEI sentence and word level attributes resulting from NLP annotations with FreeLing
  • Folder names have been adjusted to be consistent for all collections in the textbox.
  • For details, see the version history in the "next" branch: https://github.com/cligs/textbox/commits/next.

Files

cligs/textbox-v.4.0.0.zip

Files (526.1 MB)

Name Size Download all
md5:9a59e69e8d1aa90574fb2ad4e757283b
526.1 MB Preview Download

Additional details

Related works