Dataset Open Access

Transcriptional decomposition reveals active chromatin architectures and cell specific regulatory interactions

Rennie, Sarah; Dalby, Maria; van Duin, Lucas; Andersson, Robin


Resource data across the 76 cell types from FANTOM5 analysed in our paper titled "Transcriptional decomposition reveals active chromatin architectures and cell specific regulatory interactions".

Project abstract

Gene transcription is influenced by favourable chromosome positioning and chromatin architectures bringing regulatory elements in close proximity. However, it is unclear to what extent transcription is attributable to topological organisation or to gene-specific regulatory programs. Here, we develop a strategy to transcriptionally decompose expression data into two main components reflecting the positional relationship of neighbouring transcriptional units and effects independent from their positioning. 

We demonstrate that the positionally dependent component is highly informative of topological domain activity and organisation, revealing boundaries and chromatin compartments. Furthermore, features derived from transcriptional components can accurately predict individual chromatin interactions. We systematically investigate regulatory interactions and observe different transcriptional attributes governing long- and short-range interactions. Finally, we assess differences in regulatory organisations across 76 human cell types. In all, we demonstrate a close relationship between transcription and topological chromatin architecture and provide an unprecedented resource for investigations of regulatory organisations across cell types.

Included files

PD_component_76_cell_types.tar.gz - Contains the positionally dependent (PD) components for 76 human cell types.

PD_sd_component_76_cell_types.tar.gz - Contains the standard deviations of the positionally dependent (PD_sd) components for 76 human cell types.

PI_component_76_cell_types.tar.gz - Contains the positionally independent (PI) components for 76 human cell types.

PI_sd_component_76_cell_types.tar.gz - Contains the standard deviations  of the positionally independent (PI_sd) components for 76 human cell types.

predicted_EP_interactions_76_cell_types.tar.gz - Contains predicted enhancer-promoter interactions for 76 human cell types.

predicted_boundaries_76CT.txt - Matrix with 76 columns representing human cell types and 10kb genome-wide bins as rows, coded as 0 or 1 according to whether a XAD boundary was predicted in the bin for a given cell type.

Files (1.7 GB)
Name Size
PD_component_76_cell_types.tar.gz md5:b5312a5b40bfb53c634fff0aa4992bda 435.3 MB Download
PD_sd_component_76_cell_types.tar.gz md5:63a5879768284cf819c957a1cd3b18fc 408.0 MB Download
PI_component_76_cell_types.tar.gz md5:a9e42f33b072d33af60af93156ecf874 452.1 MB Download
PI_component_sd_76_cell_types.tar.gz md5:c362c9c04935a58a860b36bad285e663 331.4 MB Download
predicted_boundaries_76CT.txt md5:6985254210fe9833036cffdcaeb732aa 53.6 MB Download
predicted_EP_interactions_76_cell_types.tar.gz md5:1f1e43a00632a5d34952eac118adf515 20.1 MB Download


Cite as