Dataset Open Access

Relate-estimated coalescence rates, allele ages, and selection p-values for the 1000 Genomes Project

Speidel, Leo; Forest, Marie; Shi, Sinan; Myers, Simon R.


JSON-LD (schema.org) Export

{
  "inLanguage": {
    "alternateName": "eng", 
    "@type": "Language", 
    "name": "English"
  }, 
  "description": "<p><strong>Overview</strong></p>\n\n<p>Coalescence rates, allele ages, and p-values for evidence of positive selection calculated for 2478&nbsp;samples of the&nbsp;1000 Genomes Project&nbsp;using Relate.</p>\n\n<p>We estimated the joint genealogy of all 1000 GP populations and then extracted the embedded genealogy for each population.<br>\nFor the genealogy of each population, we jointly estimated the population size history and branch lengths.&nbsp;<br>\nVariants segregating in more than one&nbsp;population&nbsp;therefore have&nbsp;correlated but different allele ages in each population.</p>\n\n<p>Please refer to&nbsp;<a href=\"https://www.nature.com/articles/s41588-019-0484-x\">Speidel et al.&nbsp;Nature Genetics (2019)</a>&nbsp;for more details or email leo.speidel@outlook.com for any queries.</p>\n\n<p><strong>Coalescence rates</strong></p>\n\n<p>The zipped directory&nbsp;coalescence_rates.zip&nbsp;contains coalescence rates for 26 populations in the 1000 Genomes Project data set.</p>\n\n<ul>\n\t<li>The .coal files show the haploid coalescence rates, please refer to the&nbsp;<a href=\"https://myersgroup.github.io/relate/modules.html#PopulationSizeScript_FileFormats\">Relate documentation</a>&nbsp;for the file format.</li>\n\t<li>The popsize.RData file is an R data frame storing the diploid population sizes (0.5/coalescence rate) calculated using the .coal files. The columns of this data frame, named &quot;pop_size&quot;,&nbsp;are\n\t<ul>\n\t\t<li>gens_ago: Time in generations at which epoch starts. (To get years from generations, we multiply by 28.)</li>\n\t\t<li>population_size: Diploid population size in this epoch.</li>\n\t\t<li>population: Name of population&nbsp;</li>\n\t\t<li>region: Name of region (AFR, AMR, EAS, EUR, SAS)</li>\n\t</ul>\n\t</li>\n</ul>\n\n<p><strong>Allele ages and selection p-values</strong></p>\n\n<p>The zipped directories&nbsp;allele_ages_*.zip&nbsp;contain&nbsp;R&nbsp;data frames for each 1000GP population storing allele ages and selection p-values.<br>\nPlease note that only mutations that segregate in the population and map to a unique branch in the Relate-estimated marginal trees are included. Selection p-values are only provided for mutations of DAF &gt; 2 that pass quality filters (see Speidel et al., 2019).&nbsp;</p>\n\n<p>To get an age estimate for a neutral mutation, use&nbsp;0.5*(lower_age + upper_age). To get years from generations, we multiply by 28.</p>\n\n<p>The columns of these&nbsp;data frames, named &quot;allele_ages&quot;,&nbsp;are</p>\n\n<ul>\n\t<li>CHR: chromosome index</li>\n\t<li>BP: base-pair position (GRCh37)</li>\n\t<li>ID: id of SNP</li>\n\t<li>lower_age: Age in generations of coalescence event at the lower end of the branch onto which the mutation maps</li>\n\t<li>upper_age: Age in generations of coalescence event at the upper end of the branch onto which the mutation maps</li>\n\t<li>ancestral/derived: Ancestral/derived allele</li>\n\t<li>upstream: Upstream (5&#39;) allele</li>\n\t<li>downstream: Downstream (3&#39;) allele</li>\n\t<li>DAF: Derived-allele frequency</li>\n\t<li>pvalue: log10 p-value for selection evidence</li>\n</ul>", 
  "license": "http://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "Department of Statistics, University of Oxford", 
      "@id": "https://orcid.org/0000-0002-4644-8033", 
      "@type": "Person", 
      "name": "Speidel, Leo"
    }, 
    {
      "affiliation": "Universit\u00e9 du Qu\u00e9bec \u00e0 Montr\u00e9al, Montr\u00e9al, Canada", 
      "@type": "Person", 
      "name": "Forest, Marie"
    }, 
    {
      "affiliation": "Department of Statistics, University of Oxford", 
      "@type": "Person", 
      "name": "Shi, Sinan"
    }, 
    {
      "affiliation": "Department of Statistics, University of Oxford", 
      "@id": "https://orcid.org/0000-0002-2585-9626", 
      "@type": "Person", 
      "name": "Myers, Simon R."
    }
  ], 
  "url": "https://zenodo.org/record/3234689", 
  "datePublished": "2019-05-29", 
  "version": "v1.0.0", 
  "keywords": [
    "Genetics", 
    "Genealogy", 
    "Population size", 
    "Allele age", 
    "Positive selection", 
    "1000 Genomes Project"
  ], 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/00eb7c26-b189-45be-af7b-ec9b8d2ab4d7/allele_ages_AFR.zip", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/00eb7c26-b189-45be-af7b-ec9b8d2ab4d7/allele_ages_AMR.zip", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/00eb7c26-b189-45be-af7b-ec9b8d2ab4d7/allele_ages_EAS.zip", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/00eb7c26-b189-45be-af7b-ec9b8d2ab4d7/allele_ages_EUR.zip", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/00eb7c26-b189-45be-af7b-ec9b8d2ab4d7/allele_ages_SAS.zip", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/00eb7c26-b189-45be-af7b-ec9b8d2ab4d7/coalescence_rates.zip", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.3234689", 
  "@id": "https://doi.org/10.5281/zenodo.3234689", 
  "@type": "Dataset", 
  "name": "Relate-estimated coalescence rates, allele ages, and selection p-values for the 1000 Genomes Project"
}
527
276
views
downloads
All versions This version
Views 527527
Downloads 276276
Data volume 368.5 GB368.5 GB
Unique views 485485
Unique downloads 125125

Share

Cite as