Published March 31, 2020 | Version v3
Other Open

Model-driven generation of artificial yeast promoters

  • 1. Stanford University Dept. of Bioengineering
  • 2. Stanford University Dept. of Bioengineering, Chan Zuckerberg Biohub

Description

Supplementary Data for "Model-driven generation of artificial yeast promoters" (Kotopka BJ, Smolke CD. Model-driven generation of artificial yeast promoters. Nat Commun. 2020;11(1):2113.)

Abstract:

Promoters play a central role in controlling gene regulation; however, a small set of promoters is used for most genetic construct design in the yeast Saccharomyces cerevisiae. Generating and utilizing models that accurately predict protein expression from promoter sequences would enable rapid generation of useful promoters and facilitate synthetic biology efforts in this model organism. We measure the gene expression activity of over 675,000 sequences in a constitutive promoter library, and over 327,000 sequences in an inducible promoter library. Training an ensemble of convolutional neural networks jointly on the two datasets enables very high (R2 > 0.79) predictive accuracies on multiple sequence-activity prediction tasks. We describe model-guided design strategies which yield large, sequence-diverse sets of promoters exhibiting activities higher than those represented in training data and similar to current best-in-class sequences. Our results show the value of model-guided design as an approach for generating useful DNA parts.

Notes

Updated files uploaded in Version 3 reflect changes to the analysis and presentation of data in a new draft of the manuscript (2020-03).

Files

Promoter Design Data_2020-03.zip

Files (1.1 GB)

Name Size Download all
md5:e9e022a8ca5d39e7736fe9c2c5910bdb
1.1 GB Preview Download
md5:36f4b4609497c03666063dbfe8428305
4.0 MB Download

Additional details

References

  • Kotopka BJ, Smolke CD. Model-driven generation of artificial yeast promoters. Nat Commun. 2020;11(1):2113.