Published June 30, 2020 | Version 0.1
Dataset Open

CCLid: A toolkit to authenticate the genotype and stability of cancer cell lines

  • 1. University of Toronto

Description

CCLid (Cancer Cell Line identification) is designed as a toolkit to address the lack of a publicly available resource for genotype-based cell line authentication. We developed this resource to allow for genotype-matching of any given cancer cell line to the 1,204 unique cell lines found in the CCLE dataset, with support to include additional SNP array datasets. Using the B-allele frequencies (BAFs) for all SNPs found in common between the input data and reference datasets, this tool will allow for a genotype matching operation that trains and uses a logistic model to calculate the probability of the best  cell line matches. This is followed by a measure of genetic drift between isogenic lines by look for segments of the genome that have significantly different BAF values.

 

This zenodo dataset contains the (sample x probeset) BAF matrix for the CCLE dataset, as well as supporting datasets to allow mapping of SNP probesets and genotype correction between SNP array technologies (i.e. Affymetrix SNP 6.0 and Illumina HumanOmni 2.5M). This also contains all the metadata for cell line identities in CCLE, GDSC, and gCSI as well their corresponding cellosaurus unique identifies.

Files

Files (5.0 GB)

Name Size Download all
md5:083d42aa191ee8ada3f5475c04d61e7f
6.9 MB Download
md5:60b047842ca35c7283edf3c886fadf12
6.0 MB Download
md5:e37ea6f05b4aa84ba36adc6243eb8f41
4.5 MB Download
md5:395bcdb1302211d872660c89ebd01ccc
4.1 MB Download
md5:ec367a8a651b88f2d123527d0ac53444
6.5 MB Download
md5:e163a58d21f3e6530e705ff485f992c7
4.8 MB Download
md5:2367364ee6a90bb0b587ac1f2e939c37
4.2 MB Download
md5:2ce2239f292faef18898fe37b5aec29c
9.3 MB Download
md5:63cb9746a5c608a887d71e5b039afc73
991 Bytes Download
md5:d6567b6f66ddb1bef3351e28a3e0c0de
17.3 kB Download
md5:e2f6f77bc9dd7c3b57e1f09f008b4932
1.8 MB Download
md5:457ff003845cdeca25044057c9e46778
36.9 kB Download
md5:571ad9f660cbe5b9bbf6bff149f5d83a
4.9 GB Download
md5:1f37be37599dbc28de20683daa616d87
67.5 kB Download
md5:83a1a5f729e4f6dba567345c08422bee
1.4 MB Download
md5:ec4476a2562a00518ae054b92bb58c06
33.2 MB Download

Additional details

Related works

Is source of
Journal article: 10.1038/nature11003 (DOI)