Published April 25, 2022 | Version 0.1.3
Software Open

EagleC: A deep-learning framework for detecting a full range of structural variations from bulk and single-cell contact maps

  • 1. Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine Northwestern University, Chicago, Illinois, USA.

Description

Hi-C technique has been shown to be a promising method to detect structural variations (SVs) in human genomes. However, algorithms that can use Hi-C data for a full-range SV detection have been severely lacking. Current methods can only identify inter-chromosomal translocations and long-range intra-chromosomal SVs (>1Mb) at less-than-optimal resolution. Therefore, we develop EagleC, a framework that combines deep-learning and ensemble-learning strategies to predict a full-range of SVs at high-resolution. Importantly, we show that EagleC can uniquely capture a set of fusion genes that are missed by WGS or nanopore. Furthermore, EagleC also effectively captures SVs in other chromatin interaction platforms, such as HiChIP, ChIA-PET, and capture Hi-C. We apply EagleC in over 100 cancer cell lines and primary tumors, and identify a valuable set of high-quality SVs. Finally, we demonstrate that EagleC can be applied to single-cell Hi-C and used to study the SV heterogeneity in primary tumors.

Files

Files (3.8 MB)

Name Size Download all
md5:49e6c846e3d85f8d2e1b2c72b849e78b
3.1 MB Download
md5:274e05bb0ecf1e9aabdacb5e59a0b487
666.3 kB Download
md5:fb787b44244fc69b4360507697cf2a05
1.4 kB Download
md5:95acf3f174a1d1d293916b05714a592c
21.3 kB Download