Skema: A New Tool for Corpus-driven Lexicography
Creators
- 1. Instituut voor de Nederlandse Taal, The Netherlands
- 2. University of Pavia, Italy
Description
In this paper, we describe the development of Skema and its features. Skema [ˈskiːmə] is a new corpus pattern editor system which supports the manual annotation of concordance lines with user-defined labels (each concordance has its own set of labels) and the editing of the corresponding patterns in terms of slots, attributes, examples and other features following the lexicographic technique of Corpus Pattern Analysis. Skema is integrated into the web-based Sketch Engine and can be used by any user for annotating both preloaded and user corpora. Each annotation label is linked to the pattern structure (stored in JSON format) which can be easily customized to individual projects, a generic pattern structure (i.e. a list of user-defined attributes) being available by default. The paper illustrates the use of Skema in three specific projects, i.e. Woordcombinaties for Dutch verbs, Typed Predicate-Argument Structures for Italian Verbs (T-PAS) and its sister project for Croatian Verbs (CROATPAS).
Files
EURALEX2020_ProceedingsBook-p523-528.pdf
Files
(896.0 kB)
Name | Size | Download all |
---|---|---|
md5:997b604d17dd95565d30d545f8c13c4d
|
896.0 kB | Preview Download |