Published August 24, 2015 | Version v1
Dataset Open

DeepSEA model files

Description

This CNN is based on the DeepSEA model from Zhou and Troyanskaya (2015). The model has been converted to a pytorch model on a modified version of https://github.com/clcarwin/convert_torch_to_pytorch Use this model only for predictions of sequences, not for variant effect prediction. The model automatically generates reverse-complement and averages over forward and reverse-complement to results from the website. To predict variant effects use the DeepSEA/variantEffects model. It categorically predicts 919 cell type-specific epigenetic features from DNA sequence. The model is trained on publicly available ENCODE and Roadmap Epigenomics data and on DNA sequences of size 1000bp. The input of the tensor has to be (N, 4, 1, 1000) for N samples, 1000bp window size and 4 nucleotides. Per sample, 919 probabilities of a specific epigentic feature will be predicted.

Notes

This dataset is used in the Kipoi model zoo for the DeepSEA model. License: CC-BY 3.0

Files

Files (422.7 MB)

Name Size Download all
md5:89e640bf6bdbe1ff165f484d9796efc7
211.4 MB Download
md5:35956ab9c28960b5a3693f470fe980c1
211.4 MB Download