Constructing an Annotated Corpus of Verbal MWEs for English

Abigail Walsh; Claire Bonial; Kristina Geeraert; John P. McCrae; Nathan Schneider; Clarissa Somers

doi:10.18653/v1/W18-4921

Published August 1, 2018 | Version v1

Conference paper Open

Constructing an Annotated Corpus of Verbal MWEs for English

1. ADAPT Centre, Dublin City University
2. U.S. Army Research Laboratory
3. University of Alberta
4. Insight Centre for Data Analytics, National University of Ireland Galway
5. Georgetown University

This paper describes the construction and annotation of a corpus of verbal MWEs for English as part of the PARSEME Shared Task 1.1 on automatic identification of verbal MWEs. The criteria for corpus selection, the categories of MWEs used, and the training process are discussed, along with the particular issues that led to revisions in edition 1.1 of the annotation guidelines. Finally, an overview of the characteristics of the final annotated corpus is presented, as well as some discussion on inter-annotator agreement.

Files

walsh2018constructing.pdf

Files (144.7 kB)

Name	Size	Download all
walsh2018constructing.pdf md5:eea1dcb56c52dd50528a858fcbd437d7	144.7 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	33	32
Downloads	20	20
Data volume	2.9 MB	2.9 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Imprint

Proceedings of Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018).

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: March 20, 2019
Modified: July 24, 2024

Constructing an Annotated Corpus of Verbal MWEs for English

Creators

Description

Files

walsh2018constructing.pdf

Files (144.7 kB)