Published July 19, 2023 | Version v5
Dataset Open

brca_tcga_pyg_dataset

  • 1. National Resource For Network Biology
  • 2. Harvard Medical School.
  • 3. University of Minnesota

Description

This is a dataset that was generated by integrating the breast cancer (BRCA TCGA) dataset from the cBioPortal (cbioportal.org) and a biological network for node connections from Pathway Commons (www.pathwaycommons.org).

Data was preprocessed to form one dataset that could be converted to PyTorch Geometric data objects.

This data was retrieved in the CSV format, then processed to form a graph-based dataset for use with Graph Neural Networks (GNN).

The dataset contains the gene features of each patient and the overall survival time (in months) of each patient, which are the labels.

Files

brca_tcga.zip

Files (265.0 MB)

Name Size Download all
md5:a9df9ada5a52908671fe59a66f046df2
29.3 MB Preview Download
md5:a9784229aa891567613d38af85e23ae5
337.2 kB Preview Download
md5:1978fce78144be72a51671871c93be30
200.4 kB Preview Download
md5:6d13aa0e8edea993ada8da33420d68be
154.7 MB Preview Download
md5:9e1244c018334050d846afa2deeaeddb
4.3 MB Download
md5:a0b8a3bd40eca9565da0981fe07a38e7
1.2 MB Download
md5:7f8611c01cd4423038f0f778bde2871b
15.1 MB Preview Download
md5:9a73847dc1223a65a00f75bb18963851
44.8 MB Preview Download
md5:371b4d60007ead6449eefcafcf9e92cd
15.0 MB Preview Download
md5:edd9ef6e5865b6ab0356daafadb9dbdf
5.4 kB Preview Download
md5:0dae74a0b6946e892e8859997e29c204
16.2 kB Preview Download
md5:0e2eaea9102a26392c610a53864a9a46
5.4 kB Preview Download