There is a newer version of the record available.

Published March 18, 2022 | Version Version 2
Dataset Open

Simulated exome-sequencing data for a family study of lymphoid cancer

  • 1. Simon Fraser University

Description

This repository contains all the data files for a simulated exome-sequencing study of 150 families ascertained to contain at least four members affected with lymphoid cancer.

The simulated data can be found in the files section below. The files are:

  1. SLiM_output.txt - contains the SLiM-simulated, exome-wide, SNV data generated under an American-admixture demographic model,  for the American-admixed sub-population only.
  2. SLiM_output_chr8&9.txt - contains the SLiM-simulated data above for all source populations as well as the American-admixed sub-population, but only for chromosomes 8 and 9.
  3. sample_info.txt - contains pedigree information of all the disease-affected individuals and individuals connecting them along a line of descent, for all 150 ascertained pedigrees.
  4. Genotypes.zip -  a zipfile that contains 22 text files of genotypes for each chromosome. The genotypes are for simulated single-nucleotide variants on the exome and are in gene-dosage format. 
  5. SNVmaps.zip -  a zipfile that contains 22 text files giving the single-nucleotide variant information for each chromosome. 
  6. familial_cRV.txt - contains the familial causal rare variants for all 150 ascertained pedigrees.
  7. study_peds.txt - contains the 150 pedigrees ascertained to contain four or more relatives affected with lymphoid cancer.
  8. PLINKfiles.zip -  a zipfile that contains PLINK .fam, .bim and .bed files for all 22 of the chromosomes.

All the scripts used to generate these data can be found in the GitHub repository archived at https://zenodo.org/record/6347546.

We have also uploaded one intermediate .Rdata file, Chromwide.Rdata, to save the user substantial time when running the associated RMarkdown script for the simulation. We recommend loading Chromwide.Rdata into your R work-space rather than generating it from scratch.

Notes

funding acknowledgement- Natural Sciences and Engineering Research Council of Canada RGPIN-04296-2018

Files

familial_cRV.txt

Files (7.8 GB)

Name Size Download all
md5:29e6927379eaf60481b000db956aec51
217.0 MB Download
md5:a7a59e9f3e604a8e91d1d4e4df2b95f1
2.4 kB Preview Download
md5:1643e249b8e59cccdfddebb7c8fece8b
5.4 MB Preview Download
md5:32b18df1794242f117f6fb15c27baadb
37.8 kB Preview Download
md5:f1a4f477fd5ae1fd1b6fcaa414afd9c7
6.3 GB Preview Download
md5:d9cd34ec8cb4b856f48955429aa42f09
1.3 GB Preview Download
md5:9d4da1a8f016e76f3d2b9901e0920475
2.5 MB Preview Download
md5:5e3acf233c02bad40a897d2ecf41502a
552.7 kB Preview Download