Published July 16, 2024 | Version v1
Dataset Open

simulated datasets for evaluating polygenic detection methods

  • 1. ROR icon National Institute of Biomedical Genomics
  • 2. ROR icon Regional Centre for Biotechnology

Contributors

Supervisor:

  • 1. National Institute of Biomedical Genomics

Description

This dataset contains simulation files corresponding to a combination of each demographic model (1/2/3), environment (linear/quadratic), selection duration (200/400/600/800/1000), and simulation replicate(1-20). This resulted in 600 simulation files with 600 unique combinations of demographic models, environments, selection durations, and simulation replicates. For each individual in the genotype data file, we have the files containing the values of selective pressure(linear and quadratic environment) in the metadata folder. 

The variant position are 1-based which is default SLiM output. To compare the results with the causal loci user must make the positions 0-based (i.e. POS-1). The details are provided in a github tutorial.

Please refer to the documentation for a detailed description of the files and folder structure.

The article describing the simulated data and its application is accepted for publication in Nucleic Acids Research (https://doi.org/10.1093/nar/gkae1027).

Files

metadata.zip

Files (9.9 GB)

Name Size Download all
md5:d64dd528dda1212b5b97bd6dc331f803
1.9 kB Preview Download
md5:9c9f2f780a681d7087b9f8350756b269
240.5 kB Download
md5:893903acaf156e37b727da79b6cb78e5
3.6 GB Preview Download
md5:d3a16c36b629571242868fcfeb8c79bd
6.3 GB Preview Download