Published February 9, 2021 | Version v1
Journal article Open

Measures of divergence for binary data used in biodistance studies

  • 1. The Cyprus Institute
  • 2. Aristotle University of Thessaloniki

Description

Biodistance analysis can elucidate various aspects of past population structure. The most commonly adopted measure of divergence when estimating biodistances is the mean measure of divergence (MMD). The MMD is an unbiased estimator of population divergence but this property is lost when the dataset includes variables with very high or low frequency. In the present paper, we examine new measures of divergence based on untransformed binary data and the logit and probit transformations. It is shown that a measure of divergence based on untransformed data is a better unbiased estimator of population divergence. The conventional MMD is a satisfactory distance measure for binary data; however, it may produce biased estimations of population divergence when there are many traits with frequencies lower than 0.1 or/and greater than 0.9. Finally, the measures of divergence based on the probit and logit transformations are usually biased estimators.

Files

LICENSE.pdf

Files (2.0 MB)

Name Size Download all
md5:cd0dde22d8b0e42d42d3f4a3b35b6673
484.1 kB Preview Download
md5:78b5bb7f53ac034dc15f06614369a10d
1.5 MB Preview Download

Additional details

Funding

Promised – Promoting Archaeological Science in the eastern Mediterranean 811068
European Commission