Published October 27, 2020 | Version _v.01
Dataset Open

Data for: Machine learning identifies robust matrisome markers and regulatory mechanisms in cancer

Description

The expression and regulation of matrisome genes - the ensemble of extracellular matrix, ECM, ECM-associated proteins and regulators as well as cytokines, chemokines and growth factors - is of paramount importance for the many biological processes and signals within the tumor microenvironment. The availability of large and diverse multi-omics data enables mapping and understanding the regulatory circuitry governing the tumor matrisome to an unprecedented level, though such a volume of information requires robust approaches to data analysis and integration. In this study, we show that combining Pan-Cancer expression data from The Cancer Genome Atlas (TCGA) with genomics, epigenomics and microenvironmental features from TCGA and other sources enables the identification of “landmark” matrisome genes and machine learning-based reconstruction of their regulatory networks in 74 clinical and molecular subtypes of human cancers and approx. 6700 patients. These results, enriched for prognostic genes and cross-validated markers at the protein level, unravel the role of genetic and epigenetic programs in governing the tumor matrisome and allow the prioritization of tumor-specific matrisome genes (and their regulators) for the development of novel therapeutic approaches.

Notes

This study was supported by the Academy of Finland grant 329742

Files

Files (73.0 MB)

Name Size Download all
md5:8b5a1265285c38119698fca965834b34
51.7 MB Download
md5:604540b38aa61b1dec1773b26517f707
21.3 MB Download