There is a newer version of the record available.

Published January 27, 2022 | Version r20220127.1
Software Open

A Global Lexical Dataset (GLED) with cognate annotation and phonological alignments

Description

This repository comprises a dataset developed from a subset of ASJP, in which all lemmas are presented in a broad phonological transcription, automatically annotated for cognacy, and phonologically aligned. Per-family NEXUS files with binary annotation of presence/absence of cognate sets are also available. The dataset is intended to facilitate prototyping studies and methods in quantitative historical linguistics.

Notes

If you use this dataset, please cite it as below.

Files

tresoldi/gled-r20220127.1.zip

Files (36.4 MB)

Name Size Download all
md5:f97619d49c8168a3a1669f35b93122fb
36.4 MB Preview Download

Additional details

Related works