Published 2024 | Version v2
Dataset Open

Beyond benchmarking and towards predictive models of dataset-specific single-cell RNA-seq pipeline performance

  • 1. Lunenfeld-Tanenbaum Research Institute, Toronto, Canada
  • 2. Program in Bioinformatics and Computational Biology, University of Toronto, Toronto, Canada
  • 3. Vector Institute, Toronto, Canada
  • 4. Departments of Molecular Genetics, Statistical Sciences, Computer Science, University of Toronto, Toronto, Canada
  • 5. Ontario Institute for Cancer Research, Toronto Canada

Description

This repository includes:

  1. clusters.zip
    • .Once unzipped, the top-level directory is named clusters/.
      • Within this directory, there are 86 sub-directories, one for each of the scRNA-seq datasets used in the paper. These sub-directories are named with the corresponding EBI Single Cell Atlas IDs.
        • Each sub-directory contains CSV files containing clustering results for each of the pipelines run on that dataset.
  2. *_unscaled.csv 
    • CSV files containing raw performance metrics (CH, DB, SIL, GSEA) computed on each pipeline and dataset combination
  3. *CorrectedImputed.csv and gseaScaledImputed.csv
    • CSV files containing performance metrics corrected for the number of clusters and missing values imputed in the case of CH, SIL, and DB. For GSEA, only scaling and imputation was performed.
  4. pipelineParams.csv and datasetFeatures.csv
    • CSV files containing parameters used for each of the pipelines run, and dataset summary statistics for each of the scRNA-seq datasets.

Files with the prefix "large_samples" correspond to data associated with the 6 scRNA-seq datasets containing >100k cells. The large_samples_clusters.zip file contains the same structure as cluster.zip, but with 6 sub-directories. 

 

Files

ch_unscaled.csv

Files (3.8 GB)

Name Size Download all
md5:200f97f26b69688fdae885e7f00bba68
464.7 kB Preview Download
md5:6a6fbbe465fb32b3ad85de663af01019
569.8 kB Preview Download
md5:b04c33517ac9ef33203fa3647626d465
2.8 GB Preview Download
md5:4c5a706c55cb0f8e5c55e9fefeb0e6f5
62.4 kB Preview Download
md5:5bc86f92bf32bd991adef84e6cee78a1
464.5 kB Preview Download
md5:b4aabd784ef09fe448546b4f633adcdb
561.5 kB Preview Download
md5:07a9bf70113c6a86eec3e9091c617e4d
486.3 kB Preview Download
md5:13b9e233f0d48e4fdd36d74c2bdeca68
553.7 kB Preview Download
md5:316949fbd1fd250060584a6754a342ec
98.2 kB Preview Download
md5:b3cf95a99992c8f62a9a107b2c7dab48
98.1 kB Preview Download
md5:d861321d389cd638dfaae54d731adbaa
101.9 kB Preview Download
md5:39170e097345174b9cd4987ff93c5abf
411.0 kB Preview Download
md5:ab7a3317ce335ac83c122ebea3b591c2
1.0 GB Preview Download
md5:1c8a9c765591a698a37acd4a94f45896
265.9 kB Preview Download
md5:3845a58464d4868bfbbdc49f1e030069
265.8 kB Preview Download
md5:e96386afaa92ce7be39b61ae262d8702
268.1 kB Preview Download
md5:3d5fe008e2e0b4953f2e3efb692229c8
79.6 kB Preview Download
md5:c6b9ed79e93409becb58f1376dd9027b
518.2 kB Preview Download
md5:31823fa25f56cf6954a71535e010a747
563.6 kB Preview Download