Prediction and functional interpretation of inter-chromosomal genome architecture from DNA sequence with TwinC
Authors/Creators
-
Jha, Anupama
(Contact person)1, 2
-
Hristov, Borislav
(Project member)2
- Wang, Xiao (Project member)2
- Wang, Sheng (Project member)2
-
Greenleaf, William
(Project member)3
-
Kundaje, Anshul
(Project member)3
- Lieberman Aiden, Erez S (Project member)4
-
Bertero, Alessandro
(Project leader)5
-
Noble, William Stafford
(Project leader)2
Description
Three-dimensional nuclear DNA architecture comprises well-studied intra-chromosomal (cis) folding and less characterized inter-chromosomal (trans) interfaces. Current predictive models of 3D genome folding can effectively infer pairwise cis-chromatin interactions from the primary DNA sequence but generally ignore trans contacts. There is an unmet need for robust models of trans-genome organization that provide insights into their underlying principles and functional
relevance. We present TwinC, an interpretable convolutional neural network model that reliably predicts trans contacts measurable through proximity ligation-dependent (in situ and intact Hi-C) and independent (DNA SPRITE) genome-wide chromatin conformation assays. TwinC uses a paired sequence design from replicate Hi-C experiments to learn single-base pair relevance in trans interactions across two stretches of DNA. The method achieves high predictive accuracy
(AUROC=0.80) on a cross-chromosomal test set from in situ and intact Hi-C experiments in heart tissue. Furthermore, we train TwinC using in situ Hi-C data from the widely used GM12878 cell line and validate its performance with orthogonal DNA SPRITE assay in the same cell type. Mechanistically, the neural network learns the importance of compartments, chromatin accessibility, clustered transcription factor binding and G-quadruplexes in forming trans contacts. In summary, TwinC models and interprets trans genome architecture, shedding light on this poorly understood aspect of gene regulation.
Files
Files
(9.1 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:62b8ac74f17fa0151777f3b76f596867
|
3.3 GB | Download |
|
md5:e2d9ca985a12dd514b3ed67926f61ea5
|
5.6 GB | Download |
|
md5:977ef6446011486927046b813c451a8e
|
243.4 MB | Download |
Additional details
Software
- Repository URL
- https://github.com/Noble-Lab/twinc_paper
- Programming language
- Python
- Development Status
- Active