Published June 20, 2020 | Version v0.1.1
Software Open

tresoldi/distfeat: A Model of Distinctive Features for Computer-Assisted Language Comparison

  • 1. Max-Planck-Institut für Menschheitsgeschichte

Description

A model of segmental/distinctive features for the symbolic representation of sounds, covering almost 600 segments from CLTS (List et al., 2019) mapped to unique sets of bivalent features. It is being designed as an alternative input to vectors of presence/absence built from BIPA descriptors, analogous to other feature matrices like the one by Phoible (Moran & McCloy, 2019). While still under development, it can already be used both for training models of machine learning and statistics, notably decision trees, and for bootstrapping language- and process-specific models, aided by an "universal" and concise reference. A supporting Python library, distfeat, is available on PyPI.

Files

tresoldi/distfeat-v0.1.1.zip

Files (140.1 kB)

Name Size Download all
md5:f98e2a1806e2aa210b6d6ae28528bce3
140.1 kB Preview Download

Additional details

Related works

Funding

CALC – Computer-Assisted Language Comparison: Reconciling Computational and Classical Approaches in Historical Linguistics 715618
European Commission