There is a newer version of the record available.

Published September 16, 2024 | Version v2
Dataset Restricted

MASPOT panel GBS genotype data - v6.1 reference

  • 1. Aalborg University
  • 2. Cornell University

Description

Genotype-by-sequencing data of the MASPOT panel clones (762 clones in total, 755 with phenotypes) generated by Illumina sequencing of leaf tissue cf. (Sverrisdottir et al., 2017). Biallelic variants have been called relative to the v6.1 Phureja double monoploid reference genome. The SNPs have been filtered to root mean square mapping quality of > 30, MAF > 1 %, missing data < 50 %, and minimum reading depth of 5x. This leaves 175435 variants. 

This does not include the GBS data of the 18 MASPOT parent lines. These are in preparation. 

F1 sample names are in HEADER.SAMPLES, SNP identifyer and coordinates are in FILT3.KEY (both outlined in the new readme.txt. The genotypes are in the SNP_V1.0_DMv6.vcf.FILT3_FINAL.DISC.gz file. The snp_counts.txt is a file of the SNP counts in the full sets of filtration we have computed. For our purpose, we use only the discrete genotypes and the FILT3 filtration in version 1. This has the 175435 variants. 

Please see the readme.txt file for explanation of the content of each file. 

For our analysis, we are using the FILT3 filtration settings (outlined above) and discrete scale genotypes. 

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.