Published August 1, 2018 | Version v1
Conference paper Open

Constructing an Annotated Corpus of Verbal MWEs for English

  • 1. ADAPT Centre, Dublin City University
  • 2. U.S. Army Research Laboratory
  • 3. University of Alberta
  • 4. Insight Centre for Data Analytics, National University of Ireland Galway
  • 5. Georgetown University

Description

This paper describes the construction and annotation of a corpus of verbal MWEs for English as part of the PARSEME Shared Task 1.1 on automatic identification of verbal MWEs. The criteria for corpus selection, the categories of MWEs used, and the training process are discussed, along with the particular issues that led to revisions in edition 1.1 of the annotation guidelines. Finally, an overview of the characteristics of the final annotated corpus is presented, as well as some discussion on inter-annotator agreement.

Files

walsh2018constructing.pdf

Files (144.7 kB)

Name Size Download all
md5:eea1dcb56c52dd50528a858fcbd437d7
144.7 kB Preview Download