Dataset Open Access
Moreno, Victor; Diez-Obrero, Virginia; Diaz-Villanueva, Anna; Sanz-Pamplona, Rebeca
We provide significant SNP prediction models derived from the COLONOMICS data (https://www.colonomics.org). Genotypes were obtained by Affymetrix 6.0 array, imputed to TopMed panel. Gene expression was obtained from Affymetrix U219 array, DNA methylation was obtained with Illuminan 450K array and miRNA expression was obtained by NGS. We provide SNP prediction models for 1,758 genes, 30,530 CpG probes and 38 miRNAs obtained from colon normal biopsy samples. These features can be predicted from SNPs located within ±1Mb, which we assumed they act through cis mechanisms. We include the model’s summary statistics and corresponding SNP weights in SQLite objects. Models were trained using the elastic net procedure employed in the PredictDB pipeline (https://predictdb.org), according to which only models with a predictive performance p-value < 0.05 and R2 > 0.1 are considered significant. We adjusted the models by basic covariates, i.e., sex, age, tissue type and colon anatomic location where biopsies were collected (left and right colon). Genome coordinates refer to GRCh37/hg19.
|All versions||This version|
|Data volume||589.1 MB||589.1 MB|