There is a newer version of the record available.

Published July 16, 2018 | Version v1
Dataset Open

Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk (sequence model release)

Creators

  • 1. Simons Foundation and Princeton University

Description

This is the deep learning sequence model used in 

Jian Zhou, Chandra L. Theesfeld, Kevin Yao, Kathleen M. Chen, Aaron K. Wong, and Olga G. Troyanskaya, Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk, Nature Genetics, 2018.

Note the full software is available from https://github.com/FunctionLab/ExPecto and this release is created for the convenience of use and under the same non-commercial license. The model is in serialized torch t7 format that can be loaded in pytorch too with load_lua function. We also provide a web server for browsing mutations with strong predicted effects at https://hb.flatironinstitute.org/expecto/, which are currently limited to mutations within 1kb to TSS or are 1000 Genomes variants.

Trivia: we code-named our models with whale names. This model has an unofficial codename DeepSEA "Beluga".

Files

Files (1.3 GB)

Name Size Download all
md5:8ddcd81fa2afb0b59e2b3e83cc36dfb9
1.3 GB Download