There is a newer version of the record available.

Published January 14, 2026 | Version v2.0
Software Open

OMW Data

  • 1. Palacký University Olomouc

Description

Overview

This release of OMW uses the WN-LMF 1.4 schema, which adds an index attribute on LexicalEntry elements and an n attribute on Sense elements (only for WNDB-derived lexicons, namely the English wordnets). These attributes allow one to get a precise ordering of words and senses aligned to what the original Princeton WordNet does for improved reproducibility. Exceptional forms are now only paired with forms matching in case, so, for instance, Buffalo (the city) does not get an alternative form buffaloes.

This release also updates and fixes some issues with the MCR and Arabic wordnets and removes duplicate entries in many wordnets. Finally, this release also includes English wordnets derived from pre-3.0 versions of the Princeton WordNet:

  • WordNet 1.5
  • WordNet 1.6
  • WordNet 1.7
  • WordNet 1.7.1
  • WordNet 2.0
  • WordNet 2.1

See below for a more granular list of changes.

What's Changed

  • Update the MCR wordnets to the 2016 version by @ekaf in https://github.com/omwn/omw-data/pull/27
  • Fix issues with char escapes by @goodmami in https://github.com/omwn/omw-data/pull/40
  • Update wndb2lmf to build Pre-3.0 WordNets by @goodmami in https://github.com/omwn/omw-data/pull/42
  • TSV cleanup scripts by @goodmami in https://github.com/omwn/omw-data/pull/48
  • Update sum-rel.py to summarize-release.py by @goodmami in https://github.com/omwn/omw-data/pull/44
  • Remove redundant lemmas found with clean.sh by @goodmami in https://github.com/omwn/omw-data/pull/50
  • 52 allow the input file to have counts and pronunciation by @fcbond in https://github.com/omwn/omw-data/pull/53
  • Allow alternative forms for Arabic by @goodmami in https://github.com/omwn/omw-data/pull/56
  • Handle lexical gaps marked by GAP! or PSEUDOGAP! by @goodmami in https://github.com/omwn/omw-data/pull/57
  • use more environment variables so the scripts are more portable by @fcbond in https://github.com/omwn/omw-data/pull/54
  • Gh 24 unexpected identifiers by @goodmami in https://github.com/omwn/omw-data/pull/59
  • Release 2.0 by @goodmami in https://github.com/omwn/omw-data/pull/61
  • Release 2.0 Part 2 by @goodmami in https://github.com/omwn/omw-data/pull/62

New Contributors

  • @ekaf made their first contribution in https://github.com/omwn/omw-data/pull/27
  • @fcbond made their first contribution in https://github.com/omwn/omw-data/pull/53

Full Changelog: https://github.com/omwn/omw-data/compare/v1.4...v2.0

Notes

Please cite this dataset using the metadata from 'preferred-citation'.

Files

omwn/omw-data-v2.0.zip

Files (43.5 MB)

Name Size Download all
md5:dd7b8e433b8431be98d085369bf59ab8
43.5 MB Preview Download

Additional details

Related works

Is supplement to
Software: https://github.com/omwn/omw-data/tree/v2.0 (URL)

Software