Published June 12, 2023 | Version v1
Dataset Open

A common resequencing‐based genetic marker dataset for global maize diversity

Description

Maize (Zea mays ssp. mays) populations exhibit vast amounts of genetic and phenotypic diversity. As sequencing costs have declined, an increasing number of projects have sought to measure genetic differences between and within maize populations using whole genome resequencing strategies, identifying millions of segregating single-nucleotide polymorphisms (SNPs) and insertions/deletions (InDels). Unlike older genotyping strategies like microarrays and genotyping by sequencing, resequencing should, in principle, frequently identify and score common genetic variants. However, in practice, different projects frequently employ different analytical pipelines, often employ different reference genome assemblies, and consistently filter for minor allele frequency within the study population. This constrains the potential to reuse and remix data on genetic diversity generated from different projects to address new biological questions in new ways. Here we employ resequencing data from 1,276 previously published maize samples and 239 newly resequenced maize samples to generate a single unified marker set of ~366 million segregating variants and ~46 million high confidence variants scored across crop wild relatives, landraces as well as tropical and temperate lines from different breeding eras. We demonstrate that the new variant set provides increased power to identify known causal flowering time genes using previously published trait datasets, as well as the potential to track changes in the frequency of functionally distinct alleles across the global distribution of modern maize.

Notes

Files can be processed with software such as bcftools, bedtools, Plink, TASSEL, or basic Linux programs like awk or sed.

*_imputed.vcf.gz

Files with variants for 1,515 maize individuals, each for one chromosome.

WiDiv.vcf.gz

File with variants for 752 maize inbred lines from Wisconsin Diversity Panel.

Funding provided by: U.S. Department of Energy
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000015
Award Number: DE-SC0020355

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: OIA-182678

Funding provided by: National Institute of Food and Agriculture
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100005825
Award Number: 2021-67021-35329

Funding provided by: Foundation for Food and Agriculture Research
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100011929
Award Number: 602757

Funding provided by: Narodowym Centrum Nauki
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100004442
Award Number: 2012/05/B/NZ9/03407

Funding provided by: Narodowym Centrum Nauki
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100004442
Award Number: 2017/27/B/NZ9/00995

Files

README.md

Files (12.6 GB)

Name Size Download all
md5:078486ccbd23f64adb8f0e7b2bf7499f
696.7 MB Download
md5:7ab194fc9fa0f165f7840f494438985a
143.5 kB Download
md5:271d9213182175649fba9ec0fdc1ca48
1.4 GB Download
md5:45216c3ac045ad88aad9b16ddcc87be6
288.8 kB Download
md5:e1b7f4c2d7b6618077a8044a52bafb83
1.1 GB Download
md5:21a6df586e6fd487b2cffd9e28213528
228.0 kB Download
md5:ec9133243adcccd1c195734404c39e5c
1.1 GB Download
md5:ccf2db3bd590a6a706ad68f8e4894d9b
222.7 kB Download
md5:357a5fd02413ee34319925b184c00154
1.1 GB Download
md5:c491aa1b1976df2b3fd33a04f6ce9e0c
235.1 kB Download
md5:5f929b473378afacf0438d9f943015a1
958.2 MB Download
md5:0e5b43fa167772238f2fe1425da7a774
212.4 kB Download
md5:041d0a8ad300e60b1058a1807a566ae5
748.7 MB Download
md5:ea4c1bd7f1006c27e947be06a408aaca
168.1 kB Download
md5:52c00b145769a5445baa6e96d1ca5518
800.2 MB Download
md5:4a85aee9d589cb18efd6cfca7b18d3d9
172.0 kB Download
md5:58ac8d259fafda56ab5efe6df052df2b
802.5 MB Download
md5:bde89c8a829a86f7d851eeab3d088b35
172.1 kB Download
md5:ffd7f8dc010852b80582c91c93534eb0
746.3 MB Download
md5:95cf96d7a46855710693e1ad1b318318
152.0 kB Download
md5:6ab64f8ee3225ac7fd0c55e8e04bb77a
1.0 kB Preview Download
md5:52925f51561b94118b2b03ebcc0841db
3.1 GB Download
md5:286e5cb871cc4d7727cc0dbe8fe8526e
1.8 MB Download

Additional details

Related works

Is cited by
10.1111/tpj.16123 (DOI)