Published February 6, 2024 | Version 1.0
Dataset Open

Phonotacticon

  • 1. Otaru University of Commerce
  • 2. ROR icon Hong Kong Polytechnic University

Description

Phonotacticon is a database containing the basic phonotactic information of spoken lects.

The first version, Phonotacticon 1.0, contains 516 lects spoken in Eurasia.

Here we upload two files:

  • Phonotacticon1_0.csv: the first version of Phonotacticon
  • PanPhonPhonotacticon1_0.csv: a slightly modified version of PanPhon (Mortensen et al. 2016), consisting of IPA segments used in Phonotacticon 1.0 and their featural values

When using this database, please consult and cite the following article:

Joo, Ian and Yu-Yin Hsu (2025). “Phonotacticon: a cross-linguistic phonotactic database”. In: Linguistic Typology, 29.2, pp. 405-431. DOI: 10.1515/lingty-2023-0094

References
Mortensen, David R., et al. "Panphon: A resource for mapping IPA segments to articulatory feature vectors." Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. 2016.

Files

PanPhonPhonotacticon1_0.csv

Files (2.7 MB)

Name Size Download all
md5:3860f61fa9dcbc4423f21b045579a829
2.3 MB Preview Download
md5:a715af11e1ba27e95895b046eae80dad
344.1 kB Preview Download

Additional details

Related works

Is described by
Publication: 10.1515/lingty-2023-0094 (DOI)