There is a newer version of the record available.

Published January 27, 2022 | Version r20221127
Software Open

A Global Lexical Dataset (GLED) with cognate annotation and phonological alignments

Authors/Creators

Description

This work presents a lexical database encompassing most natural languages, with cognate annotation and phonological alignment, along with per-family and global phylogenetic resources. The lexical data is organized in a single and easy-to-use tabular file, and all resources are built following best practices and state-of-the-art algorithms for historical linguistics. It was developed to provide a source for prototyping studies, developing new methods, as well as bootstrapping analyses, and to allow for the community to engage in research in computational historical linguistics. The data is expected to be updated regularly, with additions and improvements. All resources are freely available for download for all interested researchers.

Notes

If you use this dataset, please cite it as below.

Files

tresoldi/gled-r20221127.zip

Files (84.9 MB)

Name Size Download all
md5:dbb072a5f258e9d09b313bc2acfa9ac1
84.9 MB Preview Download

Additional details

Related works