ecDNA machine learning modeling
Description
1. Today (2024-06-27), we discovered an issue with the labeling of sample groups in one of the supplementary figures (Supplementary Figure 14c) in our published article. We have corrected the figure and present it here, and we extend our apologies to all readers for any confusion this may have caused (although no report received).
2. The source data of supplementary figure 13 in the accompanying article table has been found to have issues, which were identified as a result of improper Excel operation. Here, we have uploaded the correct data table
--------------------------------------------------
1. ecDNA_cargo_gene_modeling_data.csv.gz
The dataset contains features from 386 TCGA tumors for modeling ecDNA cargo gene prediction. It was converted from R data format with the following code. NOTE: columns 'sample' and 'gene_id' are not used for actual modeling but for identifying, and sampling purposes.
library(data.table)
data = readRDS("~/../Downloads/ecDNA_cargo_gene_modeling_data.rds")
colnames(data)[3] = "total_cn"
data.table::fwrite(data, file = "~/../Downloads/ecDNA_cargo_gene_modeling_data.csv.gz", sep = ",")
2. gcap_pcawg_WGS_result.tar.gz
GCAP analysis results for PCAWG allele-specific copy number profiles derived from WGS.
3. gcap_tcga_snp6_result.tar.gz
GCAP analysis results for TCGA allele-specific copy number profiles derived from SNP6 array.
4. gcap_Changkang_WES_result.tar.gz
GCAP analysis results for SYSUCC Changkang allele-specific copy number profiles derived from tumor-normal paired WES.
5. tcga_overlap_gene_wgs.rds, tcga_overlap_gene_snp.rds and tcga_overlap_gene_wes.rds
These datasets contain TCGA gene-level copy number results in R data format from overlapping samples (dataset above). WGS from PCAWG, SNP array, and WES from GDC portal.
6. cellline-batch1.zip & cellline-batch1.zip
GCAP results of cell line batch 1 and batch 2.
7. AA_cellline_wgs.zip
AA software results for cell line batch 1.
8. Batch2_AA_summary.xlsx
AA software results for cell line batch 2.
9. FISH-for-supp-file.zip
Extended raw FISH images from 12 CRC samples.
10. SNU216.zip
Extended AA and GCAP analysis on SNU216.
11. aa_ffpe.zip and AA_summary_table_of_6_erbb2_ffpe_samples.xlsx
Extended AA running files (all results) and result summary data for 6 GCAP predicted ERBB2 amp clinical samples.
12. source data of fig.4
13. source data of supp fig.2 subplots
13. source data of supp fig.15
14. GCAP result data objects for three ICB cohorts. Both gene-level and sample-level data included.
15. PDX-P68: processed (AA and CNV) data of P68 from WGS and WES data.
16. source data of supp fig.13
17. updated supplementary figure 14
Files
AA_cellline_wgs.zip
Files
(5.9 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:269c2038c3bb2639ef05e1265857c479
|
71.0 MB | Preview Download |
|
md5:b0968f6ef55f121039af34981d7cbc6d
|
17.7 MB | Preview Download |
|
md5:da49912400f7679a5d320509f948390a
|
12.0 kB | Download |
|
md5:ed17bb64c60476db3b10b3cc19d61107
|
53.0 kB | Download |
|
md5:02fc5af26802bcd1e6ef79254d87c06a
|
2.7 kB | Preview Download |
|
md5:cfbbd32bf5747a73ebe7efc0298a2870
|
3.5 kB | Preview Download |
|
md5:34d4c0a74f234e66f79bf9db3a28d805
|
60.0 MB | Download |
|
md5:e63fe6ebd4056628ebfc8e9a3747d7ad
|
74.0 MB | Preview Download |
|
md5:0d5c5cc83ff5406d1c7af5ce32bcd22f
|
213.3 MB | Download |
|
md5:67719caa3057a0165aee72e7139f51ed
|
613.5 MB | Download |
|
md5:4bedfba907336dadd6869feac26463cf
|
2.0 GB | Download |
|
md5:541fb3a1871133c64472a94b3945fcc6
|
428.8 kB | Download |
|
md5:25c0fb1faebd77aeebf866267e222d86
|
80.8 kB | Download |
|
md5:b3a6ca42cebb7247e49ada317fb48999
|
59.6 kB | Download |
|
md5:5949d83230ca9db24f822c4c3379a61d
|
18.7 MB | Preview Download |
|
md5:57324054b0fcd715cffd3620215358cc
|
788.8 kB | Preview Download |
|
md5:54ad3d9a9461fedeb09cb27708103c72
|
147.8 kB | Preview Download |
|
md5:1bfdcd7b319bd91d2783f8becb85a7c9
|
16.7 MB | Preview Download |
|
md5:e6385491527f7bbe3772c9362288d169
|
27.4 MB | Preview Download |
|
md5:8166e8fea498294ea390492c6e228300
|
17.3 MB | Preview Download |
|
md5:7dd4c51758a0a2a047fd084309205262
|
64.5 MB | Download |
|
md5:c43563cdd7188de8520cfb63dbdc9810
|
882.6 kB | Preview Download |
|
md5:895607f1a26b018761e7e99aa40873b1
|
1.5 GB | Download |
|
md5:59ba4ad3d759e8548b5c8b8755583c8f
|
436.9 MB | Download |
|
md5:fb0e114dbee84a89044b0c35f17e5f34
|
729.4 MB | Download |
Additional details
Additional titles
- Alternative title
- The source data of supplementary figure 13 in the accompanying article table has been found to have issues, which were identified as a result of improper Excel operation. Here, we have uploaded the correct data table