Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published August 11, 2020 | Version v1
Dataset Open

Generation of a chromosome-scale genome assembly of the insect-repellant terpenoid-producing Lamiaceae species, Callicarpa americana

Description

Background: Plants exhibit wide chemical diversity due to production of specialized metabolites which function as pollinator attractants, defensive compounds, and signaling molecules. Lamiaceae (mints) are known for their chemodiversity and have been cultivated for use as culinary herbs and as sources of insect repellents, health-promoting compounds, and fragrance.  Findings: We report the chromosome-scale genome assembly of Callicarpa americana L. (American beautyberry), a species within the early diverging Callicarpoideae clade of the Lamiaceae, known for its metallic purple fruits and use as an insect repellent due to its production of terpenoids. Using long reads and Hi-C scaffolding, we generated a 506.1 Mb assembly spanning 17 pseudomolecules with an N50 contig and N50 scaffold size of 7.5 Mb and 29.0 Mb, respectively. A total of 32,164 genes was annotated including 53 candidate terpene synthases and 47 putative clusters of specialized metabolite biosynthetic pathways. Whole genome duplication analyses revealed three putative events, which together with local tandem duplication events, contributed to gene family expa, American beautyberransion of terpene synthases. Kolavenyl diphosphate is a gateway to many of C. americana's bioactive terpenoids; experimental validation confirmed that CamTPS2 encodes kolavenyl diphosphate synthase. Syntenic analyses with Tectona grandis L. f. (teak), a member of the Tectonoideae clade of Lamiaceae known for exceptionally strong wood resistant to insects, revealed 963 collinear blocks and 21,297 C. americana syntelogs. Conclusions: Access to the C. americana genome provides a roadmap for rapid discovery of genes encoding plant-derived agrichemicals and a key resource to understand the evolution of chemical diversity in Lamiaceae. 

 

Files

Callicarpa_RNA-Seq_TPM_expression_matrix.txt

Files (1.2 GB)

Name Size Download all
md5:c8d5e4bc93b9e4f40ce430865ac76d18
7.2 MB Preview Download
md5:44b455b285974819aa2d0b58736e486b
6.1 MB Download
md5:6969e2c4444e3d97a08d2c6cf7a1df4c
135.5 MB Download
md5:954cd1d05e31415c3ae5fa4722cac39f
88.0 MB Download
md5:c5b6439d014d9b35196af97f1bc73fd7
68.5 MB Download
md5:db46267c0299d91ea45f7240835315d4
30.2 MB Download
md5:0d51957167d44310525b159b0795c14a
578.5 kB Preview Download
md5:9e567df76ede06a06638d166bfb11288
28.0 MB Download
md5:7533619fd444b8ebfa9b175d7916aea9
13.6 MB Download
md5:77e9d1aee488bbc14be46d649ebd0868
139.6 MB Download
md5:659ecb13f863e21e71c2d6d016e5e087
91.3 MB Download
md5:e8a7ef891b2dcc40ee7962d99072e7cd
3.7 MB Preview Download
md5:9448f8eff9262812095977ec47eb2292
73.7 MB Download
md5:8a26a5b8d7850fa3a74a05e27eb578c1
31.3 MB Download
md5:4a6979e43d6e2c44a9aefcdda02e6f93
514.8 MB Download
md5:6aecbf9f1e0cd312eccdaa311aba1246
84.0 kB Download
md5:7c1bbe3bdd021ba9150e41aa422a54fd
1.1 kB Download
md5:c116fc549c754b68e6732e35caa8dce5
48.4 kB Preview Download
md5:ebc2c24492246785db41ab70c2b3b28b
617 Bytes Preview Download