Measures of divergence for binary data used in biodistance studies
Creators
- 1. The Cyprus Institute
- 2. Aristotle University of Thessaloniki
Description
Biodistance analysis can elucidate various aspects of past population structure. The most commonly adopted measure of divergence when estimating biodistances is the mean measure of divergence (MMD). The MMD is an unbiased estimator of population divergence but this property is lost when the dataset includes variables with very high or low frequency. In the present paper, we examine new measures of divergence based on untransformed binary data and the logit and probit transformations. It is shown that a measure of divergence based on untransformed data is a better unbiased estimator of population divergence. The conventional MMD is a satisfactory distance measure for binary data; however, it may produce biased estimations of population divergence when there are many traits with frequencies lower than 0.1 or/and greater than 0.9. Finally, the measures of divergence based on the probit and logit transformations are usually biased estimators.
Files
LICENSE.pdf
Files
(2.0 MB)
Name | Size | Download all |
---|---|---|
md5:cd0dde22d8b0e42d42d3f4a3b35b6673
|
484.1 kB | Preview Download |
md5:78b5bb7f53ac034dc15f06614369a10d
|
1.5 MB | Preview Download |