Dataset Open Access

Pre-Processed Cancer Multi-Omic Data from TCGA and Synthetic Data

Zhandos Sembay

JSON-LD ( Export

  "description": "<p><strong>ABSTRACT&nbsp;</strong></p>\n\n<p>It contains the data of four omic profiles (CNV, mRNA, miRNA, and protein) obtained for BRCA, LGG, and LUAD obtained from the TCGA project.&nbsp;</p>\n\n<p>In addition, we provide synthetic data for a mixture of isotropic distributions.</p>\n\n<p><strong>Instructions:&nbsp;</strong></p>\n\n<p>Cancer data are identified by cancer type (LGG: low-grade glioma, BRCA: breast cancer, and LUAD: lung cancer). The data are scaled by using the minima and maxima of each column so that the values are between 0 and 1. In these files, the columns are the features and the rows correspond to the patients.</p>\n\n<p>The summary data contains only the numerical values. The columns are the features and the rows are the observations.</p>\n\n<p><strong>Inspiration:</strong></p>\n\n<p>This dataset uploaded to U-BRITE for &quot;AI against CANCER DATA SCIENCE HACKATHON&quot;</p>\n\n<p></p>\n\n<p><strong>Acknowledgements</strong></p>\n\n<p>Diego Salazar, June 20, 2021, &quot;Pre-processed Cancer multi-omic data from TCGA and synthetic data&quot;, IEEE Dataport, doi:</p>\n\n<p></p>\n\n<p><strong>U-BRITE last update date:</strong>&nbsp;07/21/2021</p>", 
  "license": "", 
  "creator": [
      "@type": "Person", 
      "name": "Zhandos Sembay"
  "url": "", 
  "datePublished": "2021-07-21", 
  "keywords": [
  "@context": "", 
  "distribution": [
      "contentUrl": "", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
  "identifier": "", 
  "@id": "", 
  "@type": "Dataset", 
  "name": "Pre-Processed Cancer Multi-Omic Data from TCGA and Synthetic Data"
All versions This version
Views 150150
Downloads 1616
Data volume 572.5 MB572.5 MB
Unique views 138138
Unique downloads 1616


Cite as