Published January 18, 2022 | Version v1

Chromosome evolution and the genetic basis of agronomically important traits in greater yam

Description

The nutrient-rich tubers of the greater yam, Dioscorea alata L., provide food and income security for millions of people around the world. Despite its global importance, however, greater yam remains an 'orphan crop.' Here we address this resource gap by presenting a highly contiguous chromosome-scale genome assembly of D. alata combined with a dense genetic map derived from African breeding populations. The genome sequence reveals an ancient allotetraploidization in the Dioscorea lineage, followed by extensive genome-wide reorganization. Using our new genomic tools we find quantitative trait loci for resistance to anthracnose, a damaging fungal pathogen of yam, and several tuber quality traits. Genomic analysis of breeding lines reveals both extensive inbreeding as well as regions of extensive heterozygosity that may represent interspecific introgression during domestication. These tools and insights will enable yam breeders to unlock the potential of this staple crop and take full advantage of its adaptability to varied environments.

Notes

Phenotyping datasets Yam anthracnose disease (YAD) severity scale:
1 0%, no symptoms (highly resistant)
2 1–25% (moderately resistant)
3 25–50% (resistant)
4 50–75% (susceptible)
5 >75% (highly susceptible)
- Missing datum
YAD field assay: 

Visual scoring three months after planting of TDa1401, TDa1402, TDa1403, TDa1419 and TDa1427 for years 2017 and 2018. Up to three plants per genotype scored and averaged per year. (Scaled phenotype measurements not used)

YAD detached leaf assay (DLA): 

Leaf infection area measured for three ~3 month-old leaves per plant. Populations evaluated: TDa1401, TDa1402, TDa1403, TDa1419, TDa1427, TDa1506, and TDa1512. (Scaled phenotype measurements not used)

 

Tuber traits:
FreshWeightGrams Tuber fresh weight (grams).
DryWeightGrams Tuber weight after 16 hrs drying at 105 C (grams).
Oxy0Mins Oxidative browning after 0 minutes after cutting (MAC).
Oxy30Mins Oxidative browning after 30 MAC.
Oxy60Mins Oxidative browning after 60 MAC.
Oxy180Mins Oxidative browning after 180 MAC.
VisualColor Qualitative color of tuber (white, cream, orange, purple).
L CIELAB lightness reading. >0 = lighter; <0 = darker.
A CIELAB red/green reading. >0 = redder; <0 = greener.
B CIELAB yellow/blue reading. >0 = yellower; <0 = bluer.
H Munsell (HVC) Hue reading. Basic color degree: 0–100).
V Munsell (HVC) Value reading. >0 = lighter; 0 = dark.
C Munsell (HVC) Chroma reading. >0 = intense color; 0 = grey.
CORM Presence or absence of corm. 0 = Absent; 1 = Present.
CORSEP The ability of corm to separate. 0 = No; 1 = Yes.
CORTYP Corm type. 1 = regular; 2 = transversally elongated; 3 = branched.
TBRS Tuber shape. 1 = spherical/round; 2 = oval; 3 = cylindrical; 5 = irregular.
TBRSZ Tuber size. 1 = small (less than 15 cm length); 2 = medium (between 15 and 25 cm in length); 3 = big (more than 25 cm in length).
TBRST Tuber surface texture. 1 = smooth; 2 = rough.
RTBS Roots on tuber. 0 = no roots; 2 = Few; 3 = Many.
PRTBS Position of roots on tuber. 1 = Lower; 2 = Middle; 3 = Upper; 4 = Entire tuber.

Missing values encoded as "-".
 

DArTseq genotyping datasets Metadata columns in the file:
AlleleID Unique identifier for the sequence in which the SNP marker occurs.
AlleleSequence In 1 row format: the sequence of the Reference allele. In 2 rows format: the sequence of the Reference allele is in the Ref row, the sequence of the SNP allele in the SNP row.
AvgCountRef The sum of the tag read counts for all samples, divided by the number of samples with non-zero tag read counts, for the Reference allele row.
AvgCountSnp The sum of the tag read counts for all samples, divided by the number of samples with non-zero tag read counts, for the SNP allele row.
AvgPIC The average of the polymorphism information content (PIC) of the Reference and SNP allele rows.
CallRate The proportion of samples for which the genotype call is either "1" or "0", rather than "-".
FreqHets The proportion of samples which score as heterozygous.
FreqHomRef The proportion of samples which score as homozygous for the Reference allele.
FreqHomSnp The proportion of samples which score as homozygous for the SNP allele.
OneRatioRef The proportion of samples for which the genotype score is "1", in the Reference allele row.
OneRatioSnp The proportion of samples for which the genotype score is "1", in the SNP allele row.
PICRef The polymorphism information content (PIC) for the Reference allele row.
PICSnp The polymorphism information content (PIC) for the SNP allele row.
RepAvg The proportion of technical replicate assay pairs for which the marker score is consistent.
SNP In 1 row format: contains the base position and base variant details. In 2 rows format: this column is blank in the Reference row, and contains the base position and base variant details in the SNP row.
SnpPosition The position (zero indexed) in the sequence tag at which the defined SNP variant base occurs.
TrimmedSequence Same as the full sequence, but with removed adapters in short marker tags.

 

Blast columns (each column starting with is):
AlnCnt_* Total count of aligning markers / tags with selection criteria described below.
AlnEvalue_* E value of the best alignment to an existing model genome.
ChromPos_* Position(s) on contig(s) with the best alignment of marker / tag to an existing model genome.
Chrom_* Contig(s) with the best alignment of marker / tag to an existing model genome.

 

Header rows:
1 Order number where sample belongs to - important for multi-orders reports.
2 DArT plate barcode.
3 Client plate barcode.
4 Well row position.
5 Well column position.
6 Sample comments.
7 Genotype name.

 

Genotyping calls (SNP 1-row format):
0 Reference allele homozygote.
1 SNP allele homozygote.
2 Heterozygote.
- Double null/null allele homozygote (absence of fragment with SNP in genomic representation).

 

Genotyping calls (SNP 2-row format):

Each allele scored in a binary fashion. Heterozygotes are therefore scored as 1/1 (presence for both alleles/both rows).

0 Allele absent.
1 Allele present.

 

Genetic linkage maps

Genetic linkage maps are in PLINK MAP format: https://www.cog-genomics.org/plink/1.9/formats#map

Columns in MAP file:
Chr Name of genomic scaffold.
Marker Genetic marker identifier.
Genetic position Genetic linkage group position (centiMorgans).
Genomic position Genomic scaffold position (bp).
 

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: 1543967

Files

TDa1401.csv

Files (183.4 MB)

Name Size Download all
md5:3e785c6256816a36afdc65a9f45ee408
2.5 MB Download
md5:2ed3aa30c09f3df235843a05b1761228
3.0 MB Download
md5:a43b166fa9bfa3f63fb693315db0a21b
619.8 kB Download
md5:72209a89b3818d5d69855111972bfcea
82.8 kB Download
md5:98be249c2066d35606fd61c82a9588df
67.3 kB Download
md5:3a1be4b83c43598683957e72c1dc5e6f
46.4 kB Download
md5:39f0e084ff0c67fb9e4a3303ba812858
14.4 MB Preview Download
md5:552b77a988ea03059114faac402e3fa3
29.6 MB Preview Download
md5:49470666ca3522457f00a5dcec2fc8b9
13.3 MB Preview Download
md5:091f45b33a7173aa782045007e54bd50
29.2 MB Preview Download
md5:75809f48356fc89d14729b5ca01309fe
7.6 MB Preview Download
md5:9f49e33570129f8aec7139b2ebff87c7
16.8 MB Preview Download
md5:42bdc4ad78578ab893877f10f7492240
18.1 MB Preview Download
md5:c92281e06772cbcb84fe05c2ec5c0956
10.9 MB Preview Download
md5:b62baec7ba215b82b4735d646c21f929
4.5 MB Preview Download
md5:7ac7c58d33b4cd7c8b7ebd32d93363e2
11.3 MB Preview Download
md5:e84d08dab7d4ebd22f25f5d6e8ea2e4a
6.9 MB Preview Download
md5:d707608f4fe0b7dbfa617c6c5981ed55
4.7 MB Preview Download
md5:c53235014be9f75ccc77a6325ebe62cd
9.7 MB Preview Download

Additional details

Related works

Is cited by
10.1101/2021.04.14.439117 (DOI)