Significant eQTLs and expression TWAS reference panels (AMP-AD brain and EADB Belgian LCL cohorts)

Fahri Küçükali; Kristel Sleegers

doi:10.5281/zenodo.5745927

Published December 7, 2021 | Version v1

Dataset Open

Significant eQTLs and expression TWAS reference panels (AMP-AD brain and EADB Belgian LCL cohorts)

1. VIB Center for Molecular Neurology

This dataset is part of the manuscript "New insights into the genetic etiology of Alzheimer’s disease and related dementias" by Bellenguez, Küçükali, et al. Nature Genetics 2022.

Publication link: https://www.nature.com/articles/s41588-022-01024-z

GitHub repository of all QTL/TWAS data shared for this study: https://github.com/SleegersLab-VIBCMN/EADB_GWAS_NatureGenetics_QTL_TWAS

For details, please see the publication. For any questions, please contact Fahri Küçükali (fahri.kucukali@uantwerpen.vib.be) and Kristel Sleegers (Kristel.Sleegers@uantwerpen.vib.be).

Significant eQTL catalogues are compressed with gzip and tar achieve of expression TWAS reference panels are compressed with bzip2.

eQTL catalogues

The files show significant eQTL - gene pairs mapped in AMP-AD brain and EADB Belgian LCL cohorts. The catalogues are in hg38/GRCh38 human genome build. Most of the columns in the files are based on FastQTL output (http://fastqtl.sourceforge.net/).

eQTL file columns:

variant_id - ID of the significant eQTL variant based on the dbSNPv151 rsID annotation or hg38/GRCh38 CHR_POS_REF_ALT ID if rsID not available.
gene_id - eGene ENSG gene ID based on GENCODEv24 (AMP-AD) or GENCODEv32 (EADB Belgian)
tss_distance - Genomic distance between eQTL variant and eGene
ma_samples - Number of samples carrying the minor allele
ma_count - Total count of minor alleles
maf - Minor allele frequency
pval_nominal - Nominal P-value of the association
slope - Slope of the association with respect to alternative (ALT) allele indicated on column 14
slope_se - Standard error of the slope
pval_nominal_threshold - Nominal P-value significant threshold for this eGene
min_pval_nominal - Most significant P-value observed for this eGene
pval_beta - permutation P-value obtained via beta approximation and later used to calculate Storey q-values
gene_name - eGene name based on GENCODEv24 (AMP-AD) and GENCODEv32 (EADB Belgian)
GRCh38_chr_pos - Genomic position of the variant, separated by underscore
ref_alt - Reference (REF) and alternative (ALT) allele of the variant, separated by ">" sign. ALT is the tested (A1) allele

Of note, we also mapped the significant sQTLs in the same datasets (please see the data availability section of the manuscript or the GitHub repository).

Expression TWAS reference panels

Custom expression TWAS reference panels prepared using FUSION pipeline (http://gusevlab.org/projects/fusion/) in AMP-AD brain and EADB Belgian LCL cohorts. All data in hg38/GRCh38 genome build. In each directory, you will find ".pos", ."profile", and ".profile.err" files. These are explained in the FUSION website as well, but briefly these are:

.pos: This is a position file that describes the 1Mb extended gene start and end coordinates for each calculated weight file for each gene expression phenotype. Used for scanning the variants in those coordinates for TWAS.
.profile: This informs about all prediction weights calculated, in terms of number of variants in the model, heritability information, and R2 info for each prediction model used (top1, blup, enet, bslmm, lasso; bslmm was not used therefore has NA values).
.profile.err: This summarizes the reference panel in terms of average hsq (with SD), and which model is the best performing.

Each TWAS weight is provided in a .RDat file under All_Expression_Weights, and in this data we included all calculated functional weights independent of the fact that they are heritable features or not. In our TWAS analyses, we included the heritable functional weights at a hsq P-value ≤ 0.05 level.

Please also see the splicing TWAS reference panels we prepared in the same datasets (see the data availability section of the manuscript or the GitHub repository). If you need an LD reference data in hg38/GRCh38 genome build based on 1000 Genomes NFE samples (whose variant ID annotation are matching to these functional weights), suitable for running the TWAS/FUSION pipeline, please contact us.

Files

Files (19.8 GB)

Name	Size	Download all
EADB_Belgian_LCL_n70_eQTLs_signifpairs.tsv.gz md5:dcb1c02a1dc459ebd9643a6ea230e1a9	2.9 MB	Download
EADBBelgian_LCL_n70_eTWAS_weights.tar.bz2 md5:2cc88df5daf27f15afc575831ba4b85a	1.4 GB	Download
MayoRNASeq_TCX_n259_eQTLs_signifpairs.tsv.gz md5:d4c00740c8b08920117cbfe5fdf656a3	58.8 MB	Download
MayoRNASeq_TCX_n259_eTWAS_weights.tar.bz2 md5:c202fd7b875b192e9c7f315d2169ec31	3.1 GB	Download
MSBB_BA10_n207_eQTLs_signifpairs.tsv.gz md5:e171dfd1140d7fa0767613e7c90091ec	25.1 MB	Download
MSBB_BA10_n207_eTWAS_weights.tar.bz2 md5:597771f00aa674132c229ca31aacf9bf	3.1 GB	Download
MSBB_BA22_n186_eQTLs_signifpairs.tsv.gz md5:5ed8fe2b785ce40321c14cd77f47c7db	17.8 MB	Download
MSBB_BA22_n186_eTWAS_weights.tar.bz2 md5:b99ff7c6b52efd5eee9739029a66b58a	2.9 GB	Download
MSBB_BA36_n162_eQTLs_signifpairs.tsv.gz md5:baafa95757cba715c8c0dea2ae5b806a	14.4 MB	Download
MSBB_BA36_n162_eTWAS_weights.tar.bz2 md5:5209d754ea6a7c93da5147d13b7a79fb	2.7 GB	Download
MSBB_BA44_n178_eQTLs_signifpairs.tsv.gz md5:a2b26e3de322bc5b3bfa6bb5afa4e376	21.7 MB	Download
MSBB_BA44_n178_eTWAS_weights.tar.bz2 md5:88652bd5e30b286c990919324c2665d8	3.0 GB	Download
ROSMAP_DLPFC_n560_eQTLs_signifpairs.tsv.gz md5:143db13f8c4398e9b74469e364965de0	145.7 MB	Download
ROSMAP_DLPFC_n560_eTWAS_weights.tar.bz2 md5:ef5c97fa4c433a690273c22e3193dcfd	3.4 GB	Download

	All versions	This version
Views	2,005	1,990
Downloads	1,276	1,276
Data volume	2.1 TB	2.1 TB

Significant eQTLs and expression TWAS reference panels (AMP-AD brain and EADB Belgian LCL cohorts)

Authors/Creators

Description

Files

Files (19.8 GB)