Published August 2, 2024 | Version v3.0
Dataset Open

Robust estimation of cancer and immune cell-type proportions from bulk tumor ATAC-Seq data.

  • 1. Department of Oncology, Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
  • 2. Department of Pathology and Immunology Faculty of Medicine, University of Geneva, Geneva, Switzerland

Description

Bulk ATAC-seq data of tumour samples result in an averaged signal across different cell-types (cancer, stromal, vascular and immune cells). We propose a deconvolution framework called EPIC-ATAC (https://doi.org/10.7554/eLife.94833.1), which relies on newly identified cell-type specific ATAC-Seq marker peaks and reference profiles for all major cancer-relevant cell-types to predict the proportions of each cell-type.

To evaluate EPIC-ATAC, we generated a bulk ATAC-Seq dataset from peripheral blood mononuclear cells (PBMCs) samples, from which the number of cells in each cell-type has been estimated using flow cytometry, as ground truth for cell proportions. The data provided in this Zenodo deposit correspond to:

- The raw counts matrix for each peak called in this ATAC-Seq dataset: PBMC_counts.txt

- The normalized (TPM-like) counts matrix for each peak called in this ATAC-Seq dataset: PBMC_counts_norm.txt

- The cell fractions of each cell type in each sample: PBMC_cell_fractions.txt

- The peaks called in each sample using MACS2 (*narrow.peaks): *_normalized.narrowPeak

- Bed files listing ATAC-Seq fragments for each sample: *.bed

We also evaluated EPIC-ATAC on multiple pseudobulks generated from single-cell ATAC-Seq data. We provide rds files containing the pseudobulks data used in our work for the evaluation of EPIC-ATAC. The rds files are located in the zip file "pseudobulks.zip".

The file "additional_data.zip" contains additional files used to generate the reference profiles in EPIC-ATAC and to reproduce the main analyses performed in the manuscript: https://doi.org/10.7554/eLife.94833.1. These files are required to run the code available on the following GitHub repository: GfellerLab/EPIC-ATAC_manuscript. 

Files

additional_data.zip

Files (11.1 GB)

Name Size Download all
md5:2f92a61788bca7f5c89842a0707f3c51
4.3 GB Preview Download
md5:281bf2792c81e29ec729348862ea3316
1.1 kB Preview Download
md5:30af03ce45b3bc5e983e3f9a161490f4
4.0 MB Preview Download
md5:8a8afdf7bc23568fd5060ee62cce693c
11.5 MB Preview Download
md5:4baf5f7c5a93b6e7427c6732354004bf
165.9 MB Preview Download
md5:1c4b474dedcce0c97c5f7e534c78ff79
757.5 MB Download
md5:88198cadd260ac43787c6a1338a2d489
11.6 MB Download
md5:49823566a093a7c6337e69fd32998f93
1.6 GB Download
md5:467876fc4d111f92e223085ca93eb197
12.7 MB Download
md5:6767791425b90515eafa3a174be6e55f
1.2 GB Download
md5:1f0bb8486acf1cb0b203101c09d01b46
17.9 MB Download
md5:fbee65cab098598c560aac39e24fcbea
1.6 GB Download
md5:cf4ef2e03918ad48e3f31b6e5e860a8c
14.4 MB Download
md5:ad2654f3d809cef8f5500da714c074a6
1.4 GB Download
md5:cd7e863fb1fab60157d947cf28abb9f0
17.2 MB Download