There is a newer version of the record available.

Published June 27, 2024 | Version v12
Dataset Open

ecDNA machine learning modeling

Authors/Creators

  • 1. SYSUCC

Description

1. Today (2024-06-27), we discovered an issue with the labeling of sample groups in one of the supplementary figures (Supplementary Figure 14c) in our published article. We have corrected the figure and present it here, and we extend our apologies to all readers for any confusion this may have caused (although no report received).

2. The source data of supplementary figure 13 in the accompanying article table has been found to have issues, which were identified as a result of improper Excel operation. Here, we have uploaded the correct data table

--------------------------------------------------

 

1. ecDNA_cargo_gene_modeling_data.csv.gz

The dataset contains features from 386 TCGA tumors for modeling ecDNA cargo gene prediction. It was converted from R data format with the following code. NOTE: columns 'sample' and 'gene_id' are not used for actual modeling but for identifying, and sampling purposes.

library(data.table)

data = readRDS("~/../Downloads/ecDNA_cargo_gene_modeling_data.rds")

colnames(data)[3] = "total_cn"

data.table::fwrite(data, file = "~/../Downloads/ecDNA_cargo_gene_modeling_data.csv.gz", sep = ",")

 

2. gcap_pcawg_WGS_result.tar.gz

GCAP analysis results for PCAWG allele-specific copy number profiles derived from WGS.

 

3. gcap_tcga_snp6_result.tar.gz

GCAP analysis results for TCGA allele-specific copy number profiles derived from SNP6 array.

 

4. gcap_Changkang_WES_result.tar.gz

GCAP analysis results for SYSUCC Changkang allele-specific copy number profiles derived from tumor-normal paired WES.

 

5. tcga_overlap_gene_wgs.rds, tcga_overlap_gene_snp.rds and tcga_overlap_gene_wes.rds

These datasets contain TCGA gene-level copy number results in R data format from overlapping samples (dataset above). WGS from PCAWG, SNP array, and WES from GDC portal.

 

6. cellline-batch1.zip & cellline-batch1.zip

 

GCAP results of cell line batch 1 and batch 2.

 

7. AA_cellline_wgs.zip

AA software results for cell line batch 1.

 

8. Batch2_AA_summary.xlsx

AA software results for cell line batch 2.

 

9. FISH-for-supp-file.zip

Extended raw FISH images from 12 CRC samples.

 

10. SNU216.zip

Extended AA and GCAP analysis on SNU216.

 

11. aa_ffpe.zip and AA_summary_table_of_6_erbb2_ffpe_samples.xlsx

Extended AA running files (all results) and result summary data for 6 GCAP predicted ERBB2 amp clinical samples.

 

12. source data of fig.4

 

13. source data of supp fig.2 subplots

 

13. source data of supp fig.15

 

14. GCAP result data objects for three ICB cohorts. Both gene-level and sample-level data included.

 

15. PDX-P68: processed (AA and CNV) data of P68 from WGS and WES data.

 

16. source data of supp fig.13

 

17. updated supplementary figure 14

Files

AA_cellline_wgs.zip

Files (5.9 GB)

Name Size Download all
md5:269c2038c3bb2639ef05e1265857c479
71.0 MB Preview Download
md5:b0968f6ef55f121039af34981d7cbc6d
17.7 MB Preview Download
md5:da49912400f7679a5d320509f948390a
12.0 kB Download
md5:ed17bb64c60476db3b10b3cc19d61107
53.0 kB Download
md5:02fc5af26802bcd1e6ef79254d87c06a
2.7 kB Preview Download
md5:cfbbd32bf5747a73ebe7efc0298a2870
3.5 kB Preview Download
md5:34d4c0a74f234e66f79bf9db3a28d805
60.0 MB Download
md5:e63fe6ebd4056628ebfc8e9a3747d7ad
74.0 MB Preview Download
md5:0d5c5cc83ff5406d1c7af5ce32bcd22f
213.3 MB Download
md5:67719caa3057a0165aee72e7139f51ed
613.5 MB Download
md5:4bedfba907336dadd6869feac26463cf
2.0 GB Download
md5:541fb3a1871133c64472a94b3945fcc6
428.8 kB Download
md5:25c0fb1faebd77aeebf866267e222d86
80.8 kB Download
md5:b3a6ca42cebb7247e49ada317fb48999
59.6 kB Download
md5:5949d83230ca9db24f822c4c3379a61d
18.7 MB Preview Download
md5:57324054b0fcd715cffd3620215358cc
788.8 kB Preview Download
md5:54ad3d9a9461fedeb09cb27708103c72
147.8 kB Preview Download
md5:1bfdcd7b319bd91d2783f8becb85a7c9
16.7 MB Preview Download
md5:e6385491527f7bbe3772c9362288d169
27.4 MB Preview Download
md5:8166e8fea498294ea390492c6e228300
17.3 MB Preview Download
md5:7dd4c51758a0a2a047fd084309205262
64.5 MB Download
md5:c43563cdd7188de8520cfb63dbdc9810
882.6 kB Preview Download
md5:895607f1a26b018761e7e99aa40873b1
1.5 GB Download
md5:59ba4ad3d759e8548b5c8b8755583c8f
436.9 MB Download
md5:fb0e114dbee84a89044b0c35f17e5f34
729.4 MB Download

Additional details

Additional titles

Alternative title
The source data of supplementary figure 13 in the accompanying article table has been found to have issues, which were identified as a result of improper Excel operation. Here, we have uploaded the correct data table