Software Open Access
Suomela, Jukka
types2 is a tool for analysing textual diversity, richness, and productivity in text corpora and other data sets.
With this tool, we can analyse data sets from the perspective of the following statistics:
We are usually interested in comparing the number of types or hapaxes vs. the number of words or tokens. With types2, it is possible to analyse the relationship between types, hapaxes, words, and tokens.
The tool can be used for visualisation, statistical hypothesis testing, and exploratory data analysis. In the statistical analysis, we use nonparametric methods (more specifically, Monte Carlo permutation tests). The only modelling assumption is that, under the null hypothesis, individual “samples” are exchangeable.
The software is written by Jukka Suomela, and the system is designed and developed in collaboration with Tanja Säily.
Name | Size | |
---|---|---|
types-v2-release3.zip
md5:537aa469c20e3b952eee753c6c477da7 |
762.5 kB | Download |
All versions | This version | |
---|---|---|
Views | 158 | 158 |
Downloads | 14 | 14 |
Data volume | 10.7 MB | 10.7 MB |
Unique views | 156 | 156 |
Unique downloads | 14 | 14 |