There is a newer version of the record available.

Published August 12, 2020 | Version v2.0
Dataset Open

GREEN-DB: Genomic Regulatory Elements ENcyclopedia

  • 1. Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK

Description

GREEN-DB is a standardized collection of 2.4 million regulatory elements in the human genome. Information on controlled gene(s), tissue(s) and associated phenotype(s) are provided for each element when possible. We also calculated a variation constraint metric (range 0-1) for these regulatory regions and showed that genes controlled by constrained regions are enriched for disease-associated genes and essential genes from mouse knock-out screenings.

The database also includes information from ENCODE TFBS and DNase peaks; ultra-conserved non-coding elements (UCNE), and super-enhancers (dbSuper).

This release includes 2 files:

  • RegulatoryRegions.db.gz: The full database is available in SQLite format
  • GREEN-DB_v2_bedfiles.tar.gz: Contains 2 BED files describing the regulatory regions and assciated information useful for variant annotations (controlled genes, closest gene, constraint metric).

To annotate a VCF file with information from GREEN-DB you can use our tools GREEN-VARAN (https://github.com/edg1983/GREEN-VARAN).

For more information on the GREEN-DB please refer to our publication (https://doi.org/10.1101/2020.09.17.301960) and to online documentation (https://green-varan.readthedocs.io/en/latest/)

GREEN-DB is free to use for academic users, please refer to the attached LICENSE file.

Files

LICENSE.pdf

Files (7.0 GB)

Name Size Download all
md5:af16ac7b7284845a630366fe9a165a7c
604.7 MB Download
md5:a0c42f280c3ad826a916ee8b86a248c6
6.4 GB Download
md5:9e3ec86b01cca0759f05da79676ce931
69.4 kB Preview Download

Additional details

Related works

Is documented by
Preprint: 10.1101/2020.09.17.301960 (DOI)