Published June 20, 2020 | Version v0.1.1
Software Open

tresoldi/distfeat: A Model of Distinctive Features for Computer-Assisted Language Comparison

  • 1. Max-Planck-Institut für Menschheitsgeschichte


A model of segmental/distinctive features for the symbolic representation of sounds, covering almost 600 segments from CLTS (List et al., 2019) mapped to unique sets of bivalent features. It is being designed as an alternative input to vectors of presence/absence built from BIPA descriptors, analogous to other feature matrices like the one by Phoible (Moran & McCloy, 2019). While still under development, it can already be used both for training models of machine learning and statistics, notably decision trees, and for bootstrapping language- and process-specific models, aided by an "universal" and concise reference. A supporting Python library, distfeat, is available on PyPI.



Files (140.1 kB)

Name Size Download all
140.1 kB Preview Download

Additional details

Related works


CALC – Computer-Assisted Language Comparison: Reconciling Computational and Classical Approaches in Historical Linguistics 715618
European Commission