MASPOT panel GBS genotype data - v6.1 reference
Authors/Creators
- 1. Aalborg University
- 2. Cornell University
Description
Genotype-by-sequencing data of the MASPOT panel clones (762 clones in total, 755 with phenotypes) generated by Illumina sequencing of leaf tissue cf. (Sverrisdottir et al., 2017). Biallelic variants have been called relative to the v6.1 Phureja double monoploid reference genome. The SNPs have been filtered to root mean square mapping quality of > 30, MAF > 1 %, missing data < 50 %, and minimum reading depth of 5x. This leaves 175435 variants.
This does not include the GBS data of the 18 MASPOT parent lines. These are in preparation.
F1 sample names are in HEADER.SAMPLES, SNP identifyer and coordinates are in FILT3.KEY (both outlined in the new readme.txt. The genotypes are in the SNP_V1.0_DMv6.vcf.FILT3_FINAL.DISC.gz file. The snp_counts.txt is a file of the SNP counts in the full sets of filtration we have computed. For our purpose, we use only the discrete genotypes and the FILT3 filtration in version 1. This has the 175435 variants.
Please see the readme.txt file for explanation of the content of each file.
For our analysis, we are using the FILT3 filtration settings (outlined above) and discrete scale genotypes.