Published March 2, 2021 | Version v1
Dataset Open

Rcwl/RcwlPipelines workshop dataset for scRNA-seq pre-processing

Creators

  • 1. Roswell Park Comprehensive Cancer Center

Description

The scRNA-seq data source is the 1k PBMCs from 10x genomics (These source files are provided in the Zenodo data repository). The dataset used in this workshop are sub-sampled from the source files to contain only 15 cells instead of 1000. The data curation is for demo purpose only so that the execution of the Rcwl scRNA-seq preprocessing tools or pipeline in R can be completed within 1~2 minutes. 

The "*.fastq" data was curated to only include reads on chromosome 21.

“subset15_demo_barcode.txt” contains known cell barcodes for mapping and only 15 barcodes are included. 

"Homo_sapiens.GRCh37.75.21.gtf" contains the hg19 GTF file to annotate reads, which was curated on chromosome 21 only. 

Files

subset15_demo_barcode.txt

Files (63.9 MB)

Name Size Download all
md5:d1e854b1e90b1a1abe0d56aa84d06077
49.1 MB Download
md5:5fb6db0052e98a378ff6df92e1de155f
8.4 MB Download
md5:e6808012ee4b16939894d7f1a8191818
1.0 MB Download
md5:e1f4c24431ec6a231677c6447cfabacd
2.2 MB Download
md5:a902a1d3f9fca018ee94b67245db7f1f
1.0 MB Download
md5:8b8cdc67748760d1248c8718e83baf8e
2.2 MB Download
md5:45f0911858889d89f6f9780a0adefec6
255 Bytes Preview Download