GREEN-DB: Genomic Regulatory Elements ENcyclopedia
- 1. Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
Description
GREEN-DB is a standardized collection of 2.4 million regulatory elements in the human genome. Information on controlled gene(s), tissue(s) and associated phenotype(s) are provided for each element when possible. We also calculated a variation constraint metric (range 0-1) for these regulatory regions and showed that genes controlled by constrained regions are enriched for disease-associated genes and essential genes from mouse knock-out screenings.
The database also includes information from ENCODE TFBS and DNase peaks; ultra-conserved non-coding elements (UCNE), and super-enhancers (dbSuper).
This release includes 2 files:
- RegulatoryRegions.db.gz: The full database is available in SQLite format
- GREEN-DB_v2_bedfiles.tar.gz: Contains 2 BED files describing the regulatory regions and assciated information useful for variant annotations (controlled genes, closest gene, constraint metric).
To annotate a VCF file with information from GREEN-DB you can use our tools GREEN-VARAN (https://github.com/edg1983/GREEN-VARAN).
For more information on the GREEN-DB please refer to our publication (https://doi.org/10.1101/2020.09.17.301960) and to online documentation (https://green-varan.readthedocs.io/en/latest/)
GREEN-DB is free to use for academic users, please refer to the attached LICENSE file.
Files
LICENSE.pdf
Files
(7.0 GB)
Name | Size | Download all |
---|---|---|
md5:af16ac7b7284845a630366fe9a165a7c
|
604.7 MB | Download |
md5:a0c42f280c3ad826a916ee8b86a248c6
|
6.4 GB | Download |
md5:9e3ec86b01cca0759f05da79676ce931
|
69.4 kB | Preview Download |
Additional details
Related works
- Is documented by
- Preprint: 10.1101/2020.09.17.301960 (DOI)