UPDATE: Zenodo migration postponed to Oct 13 from 06:00-08:00 UTC. Read the announcement.

Dataset Open Access

Pre-Processed Cancer Multi-Omic Data from TCGA and Synthetic Data

Zhandos Sembay

ABSTRACT 

It contains the data of four omic profiles (CNV, mRNA, miRNA, and protein) obtained for BRCA, LGG, and LUAD obtained from the TCGA project. 

In addition, we provide synthetic data for a mixture of isotropic distributions.

Instructions: 

Cancer data are identified by cancer type (LGG: low-grade glioma, BRCA: breast cancer, and LUAD: lung cancer). The data are scaled by using the minima and maxima of each column so that the values are between 0 and 1. In these files, the columns are the features and the rows correspond to the patients.

The summary data contains only the numerical values. The columns are the features and the rows are the observations.

Inspiration:

This dataset uploaded to U-BRITE for "AI against CANCER DATA SCIENCE HACKATHON"

https://cancer.ubrite.org/hackathon-2021/

Acknowledgements

Diego Salazar, June 20, 2021, "Pre-processed Cancer multi-omic data from TCGA and synthetic data", IEEE Dataport, doi: https://dx.doi.org/10.21227/pjb8-d090.

https://ieee-dataport.org/documents/pre-processed-cancer-multi-omic-data-tcga-and-synthetic-data

U-BRITE last update date: 07/21/2021

U-BRITE location: /data/project/ubrite/cancer-hackathon/org/ieee-dataport/pre-processed-cancer-multi-omic-data-tcga-and-synthetic-data
Files (35.8 MB)
Name Size
DastasetFiles.zip
md5:c1a73e54862a2657482af85e357e432f
35.8 MB Download
147
15
views
downloads
All versions This version
Views 147147
Downloads 1515
Data volume 536.8 MB536.8 MB
Unique views 135135
Unique downloads 1515

Share

Cite as