Published August 14, 2020 | Version v1
Dataset Open

Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes

  • 1. University of Colorado Boulder
  • 2. Front Range Biosciences*

Description

The widespread legalization of Cannabis has opened the industry to using contemporary analytical techniques for chemotype analysis. Chemotypic data has been collected on a large variety of oil profiles inherent to the cultivars that are commercially available. The unknown gene regulation and pharmacokinetics of dozens of cannabinoids offer opportunities of high interest in pharmacology research.  Retailers in many medical and recreational jurisdictions are typically required to report chemical concentrations of at least some cannabinoids. Commercial cannabis laboratories have collected large chemotype datasets of diverse Cannabis cultivars. In this work a data set of 17,600 cultivars tested by Steep Hill Inc., is examined using machine learning techniques to interpolate missing chemotype observations and cluster cultivars into groups based on chemotype similarity.   The results indicate cultivars cluster based on their chemotypes, and that some imputation methods work better than others at grouping these cultivars based on chemotypic identity. Due to the missing data and to the low signal to noise ratio for some less common cannabinoids, their behavior could not be accurately predicted. These findings have implications for characterizing complex interactions in cannabinoid biosynthesis and improving phenotypical classification of Cannabis cultivars.

Notes

Funding provided by: Agricultural Genomics Foundation
Crossref Funder Registry ID: http://dx.doi.org/None

Funding provided by: Natural Hazards Center, University of Colorado Boulder
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100008618
Award Number: gift fund 13401977-Fin8

Files

DEN.csv

Files (16.2 MB)

Name Size Download all
md5:fb8e1b521cac07cef46afb9d81d0c1ea
63.6 kB Download
md5:ffc59e5541bce2978bf30f35e8865ff2
5.7 MB Download
md5:d50dc19f7e124190a14e86c69801e504
535.5 kB Preview Download
md5:de32e192d98426e6e37d7b55f738b6c7
501.7 kB Preview Download
md5:7310ffa31ad5950b57a4cf027ec95ab9
486.5 kB Download
md5:23f069541e8423cda9c97bd3fbdde766
559.5 kB Download
md5:2b3ef8727d56682859ffe6302b0de8bc
38.2 kB Preview Download
md5:76ed82749662cc678fa40d7ad72f6bfb
2.9 MB Preview Download
md5:f396def473a04408a96733ac6db844b4
728.0 kB Preview Download
md5:7638ac8701aead3ef0c7d9ed0611dea6
455.2 kB Preview Download
md5:12713a6a78ae7837cc893205eaf43920
1.8 MB Preview Download
md5:8387f2f974c693beb90d5c6c6ca9e707
136.8 kB Download
md5:45d10516664feeae6d3df9b6393ecf72
457.2 kB Preview Download
md5:03837e6368ba87e702f667cef32b8236
1.9 MB Preview Download

Additional details

Related works