MyND RNAseq and eQTL results

Navarro, Elisa; Raj, Towfique

doi:10.5281/zenodo.4715907

Published July 22, 2020 | Version v1

Dataset Open

MyND RNAseq and eQTL results

1. Icahn School of Medicine at Mount Sinai

This dataset is part of the manuscript: "Dysregulation of mitochondrial and proteo-lysosomal genes in Parkinson's disease myeloid cells", by Navarro E, Udine E, et al.

Description of files:

MyND_monocyte.cis_eqtl_nominal.txt.gz - Full nominal eQTL summary statistics (gzip-compressed)
MyND_monocyte.cis_eqtl_permuted.txt.gz - Full permuted eQTL summary statistics (gzip-compressed)
MyND_monocyte.cis_sqtl_nominal.txt.gz - Full nominal sQTL summary statistics (gzip-compressed)
MyND_monocyte.cis_sqtl_permuted.txt.gz - Full permuted sQTL summary statistics (gzip-compressed)
gencode.v30.primary_assembly.annotation.txt.gz - Gencode (v30) gene annotations used in the analysis (gzip-compressed)
monocyte_counts_matrix.txt.gz - RSEM counts from monocytes samples (230 samples) (gzip-compressed)
monocyte_tpms_matrix.txt.gz -RSEM TPMs from monocytes samples (230 samples) (gzip-compressed)
microglia_counts_matrix.txt.gz - RSEM counts from microglia samples (128 samples - 55 donors) (gzip-compressed)
microglia_tpms_matrix.txt.gz - RSEM TPMs from microglia samples (128 samples - 55 donors) (gzip-compressed)
processed_seurat_obj.RDS - Seurat R data object file containing single-cell RNA-seq results (14,827 features, 19,144 cells, 10 donors)

Table columns are formatted as follows:

Nominal eQTL results include all SNP-gene pairs tested (using a 1Mb window from each side of the transcription start site (TSS) of a gene). Table columns are formatted as follows:

"pheno_id" - The phenotype ID
"pheno_chr" - The chromosome ID of the phenotype
"pheno_start" - The start position of the phenotype
"pheno_end" - The end position of the phenotype
"pheno_strand" - The strand orientation of the phenotype
"num_var" - The total number of variants tested in cis
"distance" - The distance between the phenotype and the tested variant (accounting for strand orientation)
"snp_id" - The ID of the tested variant
"snp_chr" - The chromosome ID of the variant
"snp_start" - The start position of the variant
"snp_end" - The end position of the variant
"nominal_pval" - The nominal P-value of association between the variant and the phenotype
"slope" - The corresponding regression slope
"lead_snp" - A binary flag equal to 1 is the variant is the top variant in cis

Permuted eQTL results include only the top SNP-gene association for each gene (1000 permutations). Table columns are formatted as follows:

"gene_id" - The phenotype ID
"gene_chr" - The chromosome ID of the phenotype
"gene_start" - The start position of the phenotype
"gene_end" - The end position of the phenotype
"gene_strand" - The strand orientation of the phenotype
"num_var" - The total number of variants tested in cis
"distance" - The distance between the phenotype and the tested variant (accounting for strand orientation)
"snp_id" - The ID of the top variant
"snp_chr" - The chromosome ID of the top variant
"snp_start" - The start position of the top variant
"snp_end" - The end position of the top variant
"degree_of_freedom" - The number of degrees of freedom used to compute the P-values
"dummy" - Dummy
"bval1" - The first parameter value of the fitted beta distribution
"bval2" - The second parameter value of the fitted beta distribution (it also gives the effective number of independent tests in the region)
"nominal_pval" - The nominal P-value of association between the phenotype and the top variant in cis
"slope" - The corresponding regression slope
"empirical_pval" - The P-value of association adjusted for the number of variants tested in cis given by the direct method (i.e. empirircal P-value)
"beta_dist_pval" - The P-value of association adjusted for the number of variants tested in cis given by the fitted beta distribution. We strongly recommend to use this adjusted P-value in any downstream analysis

Nominal sQTL results include all SNP-junction pairs tested (using a 100kb window from the center of each intron cluster). Table columns are formatted as follows:

"pheno_id" - The phenotype ID
"pheno_chr" - The chromosome ID of the phenotype
"pheno_start" - The start position of the phenotype
"pheno_end" - The end position of the phenotype
"pheno_strand" - The strand orientation of the phenotype
"num_var" - The total number of variants tested in cis
"distance" - The distance between the phenotype and the tested variant (accounting for strand orientation)
"snp_id" - The ID of the tested variant
"snp_chr" - The chromosome ID of the variant
"snp_start" - The start position of the variant
"snp_end" - The end position of the variant
"nominal_pval" - The nominal P-value of association between the variant and the phenotype
"slope" - The corresponding regression slope
"lead_snp" - A binary flag equal to 1 is the variant is the top variant in cis

Permuted sQTL results include only the top SNP-junction association by gene (1000 permutations). Table columns are formatted as follows:

"pheno_id" - The phenotype group ID (here a gene ID)
"pheno_chr" - The chromosome ID of the phenotype group
"pheno_start" - The start position of the phenotype group
"pheno_end" - The end position of the phenotype group
"pheno_strand" - The strand orientation of the phenotype group
"pheno_id" - The top phenotype in the group (here an exon ID)
"num_pheno" - The total number of phenotypes in the group (i.e. #exons)
"num_var" - The total number of variants tested in cis
"distance" - The distance between the phenotype group and the tested variant (accounting for strand orientation)
"snp_id" - The ID of the top variant
"snp_chr" - The chromosome ID of the top variant
"snp_start" - The start position of the top variant
"snp_end" - The end position of the top variant
"degree_of_freedom” - The number of degrees of freedom used to compute the P-valuesm"
"dummy" - Dummy
"bval1" - The first parameter value of the fitted beta distribution
"bval2" - The second parameter value of the fitted beta distribution (it also gives the effective number of independent tests in the region)
"nominal_pval" - The nominal P-value of association between the top phenotype and the top variant in cis
"slope" - The corresponding regression slope
"empirical_pval" - The P-value of association adjusted for the number of variants and phenotypes tested in cis given by the direct method (i.e. empirircal P-value)
"beta_dist_pval" - The P-value of association adjusted for the number of variants and phenotypes tested in cis given by the fitted beta distribution. We strongly recommend to use this adjusted P-value in any downstream analysis

NOTE: The effect sizes of eQTLs and sQTL are defined as the effect of the alternative allele (ALT) relative to the reference (REF) allele in the human genome reference (GRCh38).

Files

Files (7.4 GB)

Name	Size	Download all
gencode.v30.primary_assembly.annotation.txt.gz md5:25de740377d384bad292b90196eac490	1.2 MB	Download
microglia_counts_matrix.txt.gz md5:023e7c02b977e9280b028311c8a67f93	7.2 MB	Download
microglia_tpms_matrix.txt.gz md5:d94e70def73144102df98173332c1751	9.1 MB	Download
monocyte_counts_matrix.txt.gz md5:e3a9418cd4bae5e5bec165a5de1bebb8	12.0 MB	Download
monocyte_tpms_matrix.txt.gz md5:5c99e2d6e5ad908921d68283bbfaa3a6	10.9 MB	Download
MyND_monocyte.cis_eqtl_nominal.txt.gz md5:0d6496a584b7943d68c28d261bab86ef	1.7 GB	Download
MyND_monocyte.cis_eqtl_permuted.txt.gz md5:7f9ddae2ad3f831f93731cabc9307d2e	1.1 MB	Download
MyND_monocyte.cis_sqtl_nominal.txt.gz md5:8668e9d2278a2d1b0183be7cb6f2f818	5.3 GB	Download
MyND_monocyte.cis_sqtl_permuted.txt.gz md5:720d6bcb20bfc315e9acecdc09648c5a	745.2 kB	Download
processed_seurat_obj.RDS md5:d18a367e2de5d40594f01a1ba569d738	355.7 MB	Download

	All versions	This version
Views	699	695
Downloads	584	584
Data volume	648.0 GB	648.0 GB

MyND RNAseq and eQTL results

Authors/Creators

Description

Files

Files (7.4 GB)