Published December 7, 2021 | Version v1
Dataset Open

Significant eQTLs and expression TWAS reference panels (AMP-AD brain and EADB Belgian LCL cohorts)

  • 1. VIB Center for Molecular Neurology

Description

This dataset is part of the manuscript "New insights into the genetic etiology of Alzheimer’s disease and related dementias" by Bellenguez, Küçükali, et al. Nature Genetics 2022.

Publication link: https://www.nature.com/articles/s41588-022-01024-z

GitHub repository of all QTL/TWAS data shared for this study: https://github.com/SleegersLab-VIBCMN/EADB_GWAS_NatureGenetics_QTL_TWAS

For details, please see the publication. For any questions, please contact Fahri Küçükali (fahri.kucukali@uantwerpen.vib.be) and Kristel Sleegers (Kristel.Sleegers@uantwerpen.vib.be).

Significant eQTL catalogues are compressed with gzip and tar achieve of expression TWAS reference panels are compressed with bzip2.

eQTL catalogues

The files show significant eQTL - gene pairs mapped in AMP-AD brain and EADB Belgian LCL cohorts. The catalogues are in hg38/GRCh38 human genome build. Most of the columns in the files are based on FastQTL output (http://fastqtl.sourceforge.net/).

eQTL file columns:

  1. variant_id - ID of the significant eQTL variant based on the dbSNPv151 rsID annotation or hg38/GRCh38 CHR_POS_REF_ALT ID if rsID not available.
  2. gene_id - eGene ENSG gene ID based on GENCODEv24 (AMP-AD) or GENCODEv32 (EADB Belgian)
  3. tss_distance - Genomic distance between eQTL variant and eGene
  4. ma_samples - Number of samples carrying the minor allele
  5. ma_count - Total count of minor alleles
  6. maf - Minor allele frequency
  7. pval_nominal - Nominal P-value of the association
  8. slope - Slope of the association with respect to alternative (ALT) allele indicated on column 14
  9. slope_se - Standard error of the slope
  10. pval_nominal_threshold - Nominal P-value significant threshold for this eGene
  11. min_pval_nominal - Most significant P-value observed for this eGene
  12. pval_beta - permutation P-value obtained via beta approximation and later used to calculate Storey q-values
  13. gene_name - eGene name based on GENCODEv24 (AMP-AD) and GENCODEv32 (EADB Belgian)
  14. GRCh38_chr_pos - Genomic position of the variant, separated by underscore
  15. ref_alt - Reference (REF) and alternative (ALT) allele of the variant, separated by ">" sign. ALT is the tested (A1) allele

Of note, we also mapped the significant sQTLs in the same datasets (please see the data availability section of the manuscript or the GitHub repository).

Expression TWAS reference panels

Custom expression TWAS reference panels prepared using FUSION pipeline (http://gusevlab.org/projects/fusion/) in AMP-AD brain and EADB Belgian LCL cohorts. All data in hg38/GRCh38 genome build. In each directory, you will find ".pos", ."profile", and ".profile.err" files. These are explained in the FUSION website as well, but briefly these are:

  1. .pos: This is a position file that describes the 1Mb extended gene start and end coordinates for each calculated weight file for each gene expression phenotype. Used for scanning the variants in those coordinates for TWAS.
  2. .profile: This informs about all prediction weights calculated, in terms of number of variants in the model, heritability information, and R2 info for each prediction model used (top1, blup, enet, bslmm, lasso; bslmm was not used therefore has NA values).
  3. .profile.err: This summarizes the reference panel in terms of average hsq (with SD), and which model is the best performing.

Each TWAS weight is provided in a .RDat file under All_Expression_Weights, and in this data we included all calculated functional weights independent of the fact that they are heritable features or not. In our TWAS analyses, we included the heritable functional weights at a hsq P-value ≤ 0.05 level.

Please also see the splicing TWAS reference panels we prepared in the same datasets (see the data availability section of the manuscript or the GitHub repository). If you need an LD reference data in hg38/GRCh38 genome build based on 1000 Genomes NFE samples (whose variant ID annotation are matching to these functional weights), suitable for running the TWAS/FUSION pipeline, please contact us.

Files

Files (19.8 GB)

Name Size Download all
md5:dcb1c02a1dc459ebd9643a6ea230e1a9
2.9 MB Download
md5:2cc88df5daf27f15afc575831ba4b85a
1.4 GB Download
md5:d4c00740c8b08920117cbfe5fdf656a3
58.8 MB Download
md5:c202fd7b875b192e9c7f315d2169ec31
3.1 GB Download
md5:e171dfd1140d7fa0767613e7c90091ec
25.1 MB Download
md5:597771f00aa674132c229ca31aacf9bf
3.1 GB Download
md5:5ed8fe2b785ce40321c14cd77f47c7db
17.8 MB Download
md5:b99ff7c6b52efd5eee9739029a66b58a
2.9 GB Download
md5:baafa95757cba715c8c0dea2ae5b806a
14.4 MB Download
md5:5209d754ea6a7c93da5147d13b7a79fb
2.7 GB Download
md5:a2b26e3de322bc5b3bfa6bb5afa4e376
21.7 MB Download
md5:88652bd5e30b286c990919324c2665d8
3.0 GB Download
md5:143db13f8c4398e9b74469e364965de0
145.7 MB Download
md5:ef5c97fa4c433a690273c22e3193dcfd
3.4 GB Download