Published June 15, 2021 | Version 0.1
Dataset Open

snpQT reference data

  • 1. Christina
  • 2. Benjamin
  • 3. Andrew
  • 4. William

Description

This dataset is a collection of other publicly available data, formatted for use with the snpQT pipeline, including:

  • An extra filtered 1,000 genome project plink2 dataset initially  downloaded from (http://dx.doi.org/10.5524/100516) for population stratification
  • EBI's latest release of 1,000 human genome VCF files for imputation and phasing (along with index files)
  • Reference FASTA, VCF and chain files for fixing strand issues, imputation, phasing and human genome build conversion
  • A TXT file including the high linkage disequilibrium regions in the human genome in hg37
  • Auxiliary TXT files for intermediate snpQT processes

Files

Files (33.7 GB)

Name Size Download all
md5:f98583e4460e5af6969be5b5323d881f
18.6 GB Download
md5:3252fe22e9df016ab6cf1c413885fc5d
15.0 GB Download

Additional details

Related works

Requires
Software: 10.5281/zenodo.4945217 (DOI)