Book section Open Access

# Lexical encoding formats for multi-word expressions: The challenge of irregular'' regularities

Timm Lichte; Simon Petitjean; Agata Savary; Jakub Waszczuk

### DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<identifier identifierType="DOI">10.5281/zenodo.2579033</identifier>
<creators>
<creator>
<creatorName>Timm Lichte</creatorName>
<affiliation>University of Düsseldorf</affiliation>
</creator>
<creator>
<creatorName>Simon Petitjean</creatorName>
</creator>
<creator>
<creatorName>Agata Savary</creatorName>
</creator>
<creator>
<creatorName>Jakub Waszczuk</creatorName>
</creator>
</creators>
<titles>
<title>Lexical encoding formats for multi-word expressions: The challenge of irregular'' regularities</title>
</titles>
<publisher>Zenodo</publisher>
<publicationYear>2019</publicationYear>
<dates>
<date dateType="Issued">2019-02-27</date>
</dates>
<language>en</language>
<resourceType resourceTypeGeneral="Text">Book section</resourceType>
<alternateIdentifiers>
<alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/2579033</alternateIdentifier>
</alternateIdentifiers>
<relatedIdentifiers>
<relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.2579032</relatedIdentifier>
<relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/langscipress</relatedIdentifier>
</relatedIdentifiers>
<rightsList>
<rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
</rightsList>
<descriptions>
<description descriptionType="Abstract">&lt;p&gt;This chapter contributes a general overview and discussion of lexical encoding formats for multi-word expressions (MWEs) that can be used in NLP systems, in particular with large-scale grammars. The presentation is kept general in the sense that we will try to elicit basic aspects of lexical encoding and then elaborate on the specific sorts of challenges encountered when dealing with MWEs, especially the irregular&amp;#39;&amp;#39; regularities mentioned in the title. These insights will eventually be used to classify and evaluate different approaches to encoding. Even though this kind of evaluation cannot be conclusive given the diversity of languages and tastes, we will nevertheless argue in favor of fully flexible encoding formats exemplified with PATR-II and XMG, as opposed to the fixed encoding formats of DuELME and Walenty.&lt;/p&gt;</description>
</descriptions>
</resource>

11
10
views
All versions This version
Views 1111
Data volume 2.7 MB2.7 MB
Unique views 99