Published March 20, 2023 | Version v1.2
Software Open

Adjectiveness dataset for past participles in Dutch

Description

This dataset contains the adjectiveness values of, reasonably, almost all participles in Dutch. 'Adjectiveness' is a measure which expresses how often a past participle is used as an adjective. The value ranges from 0 (never used as an adjective) to 1 (always used as an adjective).

The dataset contains four columns:

  • participle: the participle for which adjectiveness is computed
  • adjectiveness: the adjectiveness of said participle (undeclensed adjectives only)
  • declensed_adjectiveness: the adjectiveness of said participle (declensed adjectives only)
  • total_adjectiveness: the adjectiveness of said participle (undeclensed and declensed adjectives)

Declensed adjectiveness is a very bad measure and should not be used. Adjectiveness and total adjectiveness (so also including declensed forms) correlate very strongly ($\rho$ = 0.99), so which measure you use depends on your linguistic viewpoints.

To find out how this file was generated, please consult the README of this repository.

Notes

If you use this software, please cite it as below.

Files

AntheSevenants/Adjectiveness-v1.2.zip

Files (16.7 kB)

Name Size Download all
md5:3e7759aa40fb9c8e9d78cc9d7970acc9
16.7 kB Preview Download

Additional details