There is a newer version of the record available.

Published October 6, 2023 | Version v1
Conference paper Open

Polygraph: A Software Framework for the Systematic Assessment of Synthetic Regulatory DNA Elements

  • 1. College of Computing, Georgia Institute of Technology
  • 2. Department of Biology Research | AI Development, Genentech Research and Early Development

Description

The design of regulatory elements is pivotal in numerous therapeutic interventions, including gene and cell therapy, wherein the typical objective is to engineer DNA sequences exhibiting specific attributes like cell-type specificity and elevated expression levels. However, the systematic assessment of these constructed DNA sequences remains challenging due to the absence of robust metrics and an integrated software framework. Here, we introduce Polygraph, a Python framework for evaluating synthetic DNA sequences. Polygraph provides a variety of features to streamline the synthesis and scrutiny of regulatory elements, incorporating features like a diversity index, motif and k-mer composition, similarity to endogenous regulatory sequences, and screening with predictive and foundational models. Consequently, Polygraph stands as a formidable instrument for the assessment of synthetic regulatory sequences, enabling expedited progress in therapeutic applications and enhancing our comprehension of gene regulatory mechanisms.

Code can be found at the following repository: github.com/Genentech/polygraph

Files

human_seqs.txt

Files (363.3 MB)

Name Size Download all
md5:b9d725df369863522c2f3721e7a7c616
61.8 MB Download
md5:7ee1327effe2d17b98be461eac36ea46
288.4 MB Download
md5:7afe4fcc52265fc7d22c58d8e442b4d9
2.7 MB Preview Download
md5:8edbf14802a7876a321b9d9787167ac7
22.7 kB Preview Download
md5:6c51676ea63169e1fef99b243fc5b249
133.2 kB Preview Download
md5:6a242663c568862b0492a6219b6ae8c3
10.0 MB Download
md5:fff9544ad52d7745b7f53f6a61311404
192.1 kB Preview Download