Planned intervention: On Wednesday June 26th 05:30 UTC Zenodo will be unavailable for 10-20 minutes to perform a storage cluster upgrade.
Published November 30, 2020 | Version v1
Conference paper Open

Skema: A New Tool for Corpus-driven Lexicography

  • 1. Instituut voor de Nederlandse Taal, The Netherlands
  • 2. University of Pavia, Italy

Description

In this paper, we describe the development of Skema and its features. Skema [ˈskiːmə] is a new corpus pattern editor system which supports the manual annotation of concordance lines with user-defined labels (each concordance has its own set of labels) and the editing of the corresponding patterns in terms of slots, attributes, examples and other features following the lexicographic technique of Corpus Pattern Analysis. Skema is integrated into the web-based Sketch Engine and can be used by any user for annotating both preloaded and user corpora. Each annotation label is linked to the pattern structure (stored in JSON format) which can be easily customized to individual projects, a generic pattern structure (i.e. a list of user-defined attributes) being available by default. The paper illustrates the use of Skema in three specific projects, i.e. Woordcombinaties for Dutch verbs, Typed Predicate-Argument Structures for Italian Verbs (T-PAS) and its sister project for Croatian Verbs (CROATPAS).

Files

EURALEX2020_ProceedingsBook-p523-528.pdf

Files (896.0 kB)

Name Size Download all
md5:997b604d17dd95565d30d545f8c13c4d
896.0 kB Preview Download

Additional details

Funding

ELEXIS – European Lexicographic Infrastructure 731015
European Commission