Single cell copy number and spatial transcriptomics of ovarian cancer specimens
Description
This data is supplementary to the report "Integrated single-cell whole genome sequencing and spatial transcriptomics reveal intra-tumoral heterogeneity in ovarian cancer" by Bassiouni et al.
Background
Ovarian tumors often comprise clonal mixtures arising from chromosomal instability. To characterize intra-tumoral heterogeneity, we performed single cell whole genome sequencing (scWGS) of five ovarian cencer specimens using the 10x Genomics Chromium Single Cell DNA Reagent Kit. For four of these, we also performed spatial transcriptomics using the 10x Genomics Visium FFPE Spatial Gene Expression Kit. Sample OV773 was not subject to spatial transcriptomics due to sample scarcity.
Archived here are the processed data files (ie, 10x Genomics pipeline outputs) for these experiments. Further details on the files includes are below.
Single cell copy number
Libraries comprising individually-barcoded and amplified single cell DNA were sequenced on an Illumina NovaSeq 6000 instrument. Sequencing data was then processed with the 10x Genomics Cell Ranger DNA pipeline (v. 1.0.0) to generate processed single cell copy number data and associated metrics. Archived here are the following pipeline output files for each sample:
- cnv_data.h5
- node_cnv_calls.bed
- mappable_regions.bed
- per_cell_summary_metrics.csv
Detailed information about these outputs can be found on the 10x Genomics website: https://support.10xgenomics.com/single-cell-dna/software/pipelines/latest/output/overview
Corresponding scWGS BAM files are archived with controlled access at the European Genome-Phenome Archive (EGA) at accession EGAD50000002102. Access to the dataset is controlled by the Data Access Committee EGAC50000000834 and governed by policy EGAP50000000783.
Also archived here are the following R scripts for analysing and plotting the single cell copy number data:
- scDNA_cleanup_and_clustering.R [for cleaning up the data, clustering, and plotting heatmaps]
- cluster_lineplots.R [for creating cluster-average copy number line plots]
- BAF_with_ideogram.R [for binning and plotting B allele frequencies per cluster]
For convenience, we also include the following Python script for splitting single-cell copy number BAM files, which are archived in the European-Genome Phenome Archive under controlled access.
- template-single-cell_splitBam.py [for splitting the BAM file by cell barcodes]
Spatial transcriptomics
DNA libraries representing spatially barcoded, probe-captured RNA molecules were sequenced on an Illumina NextSeq 2000 instrument. Sequencing data was processed with the 10x Genomics Space Ranger pipelines (v. 3.0.1) to generate processed spatially-resolved gene expression data. Archived here are the following pipeline output files for each sample:
- cloupe.cloupe
- filtered_feature_bc_matrix.h5
- scalefactors_json.json
- tissue_hires_image.png
- tissue_positions.csv
Detailed information about these files can be found on the 10x Genomics website: https://www.10xgenomics.com/support/software/space-ranger/latest/analysis/outputs/output-overview.
Corresponding spatial transcriptomics BAM files are archived in the European Nucleotide Archive with project accession number PRJEB102074.
Files
OV150_scCN_data-files.zip
Files
(14.3 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:99af7724be597283baacabdb6c06ef5e
|
1.1 GB | Preview Download |
|
md5:fadd49eae64fc4d67049024ad77dd74d
|
1.5 GB | Preview Download |
|
md5:fd9058833ba3f4f3d89b1e8bbb953782
|
915.0 MB | Preview Download |
|
md5:bdf6f54d7d6c2028cc8d08667fc5175e
|
3.3 GB | Preview Download |
|
md5:634ad2bd42f734aa0a1d8839d629aabf
|
1000.0 MB | Preview Download |
|
md5:6e86d94c18111a0a14b3a35ef2c5bea0
|
3.1 GB | Preview Download |
|
md5:27596f2a0da0eb96d841b783c34ad714
|
1.2 GB | Preview Download |
|
md5:06884d0e7fc8419c9bb9ca471e7c592c
|
1.8 GB | Preview Download |
|
md5:9eb82be4e36e3724772d3aee9e3af4e2
|
472.3 MB | Preview Download |
|
md5:48a5d2b275062b7c069cf9caa1268171
|
15.7 kB | Preview Download |