Published September 9, 2016 | Version v1
Dataset Open

R data set: The Cancer Genome Atlas Gene Expression data

  • 1. National Institute for Genomic Medicine, Mexico

Description

This compound data set comprises the following information from the The Cancer Genome Atlas:

  • RNA-Seq counts for 60483 genes across 11093 samples
  • HuEx 1.0 ST gene expression data for 18632 genes across 1211 samples
  • clinical indicators for 11160 patients

All gene expression data is annotated across ENSEMBL, ENTREZ and symbols. Samples are annotated by TCGA barcodes.

To read the data set into R (requires 6 GB of RAM) use:

tcga <- readRDS("tcga.rds")

Files

Files (1.1 GB)

Name Size Download all
md5:43b877265533481bd2171b821ef1d61b
1.1 GB Download