Published January 25, 2022 | Version v1
Software Open

ModEst - Precise estimation of genome size from NGS data

  • 1. Senckenberg Society for Nature Research

Description

Accurate estimates of genome sizes are important parameters for both theoretical and practical biodiversity genomics. We present here a fast, easy-to-implement and precise method to estimate genome size from the number of bases sequenced and the mean sequencing depth. To estimate the latter, we take advantage of the fact that a precise estimation of the Poisson distribution parameter lambda is possible from truncated data, restricted to the part of the sequencing depth distribution representing the true underlying distribution. With simulations we could show that reasonable genome size estimates can be gained even from low-coverage (10X), highly discontinuous genome drafts. Comparison of estimates from a wide range of taxa and sequencing strategies with flow-cytometry estimates of the same individuals showed a very good fit and suggested that both methods yield comparable, interchangeable results.

Files

Files (1.3 kB)

Name Size Download all
md5:2671220d4e95f36a093fb5f916cc5e9b
1.3 kB Download

Additional details

Related works

Is source of
10.5061/dryad.dr7sqvb0j (DOI)