Published March 7, 2022 | Version v1
Dataset Open

COLONOMICS - predictive models for normal colon gene expression and DNA methylation for TWAS and MWAS

  • 1. Catalan Institute of Oncology, IDIBELL, UB, CIBERESP
  • 2. IDIBELL, CIBERESP

Description

We provide significant SNP prediction models derived from the COLONOMICS data (https://www.colonomics.org). Genotypes were obtained by Affymetrix 6.0 array, imputed to TopMed panel. Gene expression was obtained from Affymetrix U219 array, DNA methylation was obtained with Illuminan 450K array and miRNA expression was obtained by NGS. We provide SNP prediction models for 1,758 genes, 30,530 CpG probes and 38 miRNAs obtained from colon normal biopsy samples. These features can be predicted from SNPs located within ±1Mb, which we assumed they act through cis mechanisms. We include the model’s summary statistics and corresponding SNP weights in SQLite objects. Models were trained using the elastic net procedure employed in the PredictDB pipeline (https://predictdb.org), according to which only models with a predictive performance p-value < 0.05 and R2 > 0.1 are considered significant. We adjusted the models by basic covariates, i.e., sex, age, tissue type and colon anatomic location where biopsies were collected (left and right colon). Genome coordinates refer to GRCh37/hg19.

Files

Files (261.9 MB)

Name Size Download all
md5:ed25b7a38f0a306c8ca9a3d0e103f312
11.2 MB Download
md5:02d5e2539610f730c66833d6fe9e635a
2.3 MB Download
md5:78f40d03eff4ba0dfb57505ad3c540a3
234.8 MB Download
md5:b34532c3b6b0a2a8b15876eda72bf02d
13.4 MB Download
md5:a961074fbf3d3381791146616b6d7c2c
116.2 kB Download
md5:378ca765f4b774c31a9763afdf976ccd
39.7 kB Download

Additional details

Related works

Is derived from
Dataset: 10.34810/data169 (DOI)
Is described by
Dataset: https://www.colonomics.org (URL)

Funding

HIPERDART – Development of High Performance Diagnostic Array Replication Technology 223378
European Commission