Software Open Access

tresoldi/distfeat: A Model of Distinctive Features for Computer-Assisted Language Comparison

Tiago Tresoldi

A model of segmental/distinctive features for the symbolic representation of sounds, covering almost 600 segments from CLTS (List et al., 2019) mapped to unique sets of bivalent features. It is being designed as an alternative input to vectors of presence/absence built from BIPA descriptors, analogous to other feature matrices like the one by Phoible (Moran & McCloy, 2019). While still under development, it can already be used both for training models of machine learning and statistics, notably decision trees, and for bootstrapping language- and process-specific models, aided by an "universal" and concise reference. A supporting Python library, distfeat, is available on PyPI.

Files (140.1 kB)
Name Size
140.1 kB Download
All versions This version
Views 141141
Downloads 66
Data volume 840.4 kB840.4 kB
Unique views 134134
Unique downloads 66


Cite as