Modelling Frequency and Attestations for OntoLex-Lemon
Creators
- 1. Goethe-Universität Frankfurt am Main
- 2. Instituut voor de Nederlandse Taal
- 3. Istituto di Linguistica Computazionale "A. Zampolli" (ILC-CNR) Pisa
- 4. Leiden University
- 5. DFKI GmbH
- 6. National University of Ireland Galway
Description
The OntoLex vocabulary enjoys increasing popularity as a means of publishing lexical resources with RDF and as Linked Data. The recent publication of a new OntoLex module for lexicography, lexicog, reflects its increasing importance for digital lexicography. However, not all aspects of digital lexicography have been covered to the same extent. In particular, supplementary information drawn from corpora such as frequency information, links to attestations, and collocation data were considered to be beyond the scope of lexicog. Therefore, the OntoLex community has put forward the proposal for a novel module for frequency, attestation and corpus information (FrAC), that not only covers the requirements of digital lexicography, but also accommodates essential data structures for lexical information in natural language processing. This paper introduces the current state of the OntoLex-FrAC vocabulary, describes its structure, some selected use cases, elementary concepts and fundamental definitions, with a focus on frequency and attestations.
Files
chiarcos2020modelling.pdf
Files
(485.1 kB)
Name | Size | Download all |
---|---|---|
md5:c51e041792bba816f451dcd063496f0e
|
485.1 kB | Preview Download |