Published December 4, 2014 | Version v1
Publication Open

Formalizing MultiWords as Catenae in a Treebank and in a Lexicon

  • 1. Institute of Information and Communication Technologies, BAS

Description

The paper presents formalization of multiwords as catenae in a treebank and in a lexicon. We view catenae as a dependency subtree, which reflects non-constituents and non-standard dependencies. Since the multiword classifications vary to great extent, starting from very narrow ones and proliferating to extended ones which include also valences, the focus in the paper is not on the multiword typology per se, but on the general formalization of multi-words.

Files

206_tlt13-proceedings.pdf

Files (153.7 kB)

Name Size Download all
md5:6e8b1ac38ad23ae6b056618c6a93f8ae
153.7 kB Preview Download