simulated datasets for evaluating polygenic detection methods
Authors/Creators
Description
This dataset contains simulation files corresponding to a combination of each demographic model (1/2/3), environment (linear/quadratic), selection duration (200/400/600/800/1000), and simulation replicate(1-20). This resulted in 600 simulation files with 600 unique combinations of demographic models, environments, selection durations, and simulation replicates. For each individual in the genotype data file, we have the files containing the values of selective pressure(linear and quadratic environment) in the metadata folder.
The variant position are 1-based which is default SLiM output. To compare the results with the causal loci user must make the positions 0-based (i.e. POS-1). The details are provided in a github tutorial.
Please refer to the documentation for a detailed description of the files and folder structure.
The article describing the simulated data and its application is accepted for publication in Nucleic Acids Research (https://doi.org/10.1093/nar/gkae1027).