Dataset Open Access
Edoardo Giacopuzzi;
Niko Popitsch;
Jenny C Taylor
GREEN-DB is a standardized collection of 2.4 million regulatory elements in the human genome. Information on controlled gene(s), tissue(s) and associated phenotype(s) are provided for each element when possible. We also calculated a variation constraint metric (range 0-1) for these regulatory regions and showed that genes controlled by constrained regions are enriched for disease-associated genes and essential genes from mouse knock-out screenings.
The database also includes information from ENCODE TFBS and DNase peaks; ultra-conserved non-coding elements (UCNE), and super-enhancers (dbSuper).
This release includes 2 files:
To annotate a VCF file with information from GREEN-DB you can use our tools GREEN-VARAN (https://github.com/edg1983/GREEN-VARAN).
For more information on the GREEN-DB please refer to our publication (https://doi.org/10.1101/2020.09.17.301960) and to online documentation (https://green-varan.readthedocs.io/en/latest/)
GREEN-DB is free to use for academic users, please refer to the attached LICENSE file.
Name | Size | |
---|---|---|
GREEN-DB_v2_bedfiles.tar.gz
md5:af16ac7b7284845a630366fe9a165a7c |
604.7 MB | Download |
GREEN-DB_v2_SQLite.tar.gz
md5:a0c42f280c3ad826a916ee8b86a248c6 |
6.4 GB | Download |
LICENSE.pdf
md5:9e3ec86b01cca0759f05da79676ce931 |
69.4 kB | Download |
All versions | This version | |
---|---|---|
Views | 112 | 112 |
Downloads | 104 | 104 |
Data volume | 177.5 GB | 177.5 GB |
Unique views | 93 | 93 |
Unique downloads | 86 | 86 |