Dataset Open Access

GWAS and GTEx QTL integration

Barbeira, Alvaro Numa; Bonazzola, Rodrigo; Gamazon, Eric R; Liang, Yanyu; Park, YoSon; Kim-Hellmuth, Sarah; Wang, Gao; Jiang, Zhuoxun; Zhou, Dan; Hormozdiari, Farhad; Liu, Boxiang; Rao, Abhiram; Hamel, Andrew R; Pividori, Milton D; Aguet, François; Bastarache, Lisa; Jordan, Daniel M; Verbanck, Marie; Do, Ron; Stephens, Matthew; Ardlie, Kristin; McCarthy, Mark; Montgomery, Stephen B; Segré, Ayellet V; Brown, Christopher D; Lappalainen, Tuuli; Wen, Xiaoquan; Im, Hae Kyung


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">GWAS</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">genomics</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">QTL</subfield>
  </datafield>
  <controlfield tag="005">20200124192617.0</controlfield>
  <datafield tag="500" ind1=" " ind2=" ">
    <subfield code="a">This data was analyzed in:
https://www.biorxiv.org/content/early/2019/10/22/814350.full.pdf</subfield>
  </datafield>
  <controlfield tag="001">3518299</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">The University of Chicago</subfield>
    <subfield code="a">Bonazzola, Rodrigo</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Vanderbilt University</subfield>
    <subfield code="0">(orcid)0000-0003-4204-8734</subfield>
    <subfield code="a">Gamazon, Eric R</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">The University of Chicago</subfield>
    <subfield code="0">(orcid)0000-0001-8893-6505</subfield>
    <subfield code="a">Liang, Yanyu</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Pennsylvania</subfield>
    <subfield code="0">(orcid)0000-0002-0465-4744</subfield>
    <subfield code="a">Park, YoSon</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">New York Genome Center</subfield>
    <subfield code="0">(orcid)0000-0001-8791-5729</subfield>
    <subfield code="a">Kim-Hellmuth, Sarah</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">The University of Chicago</subfield>
    <subfield code="0">(orcid)0000-0001-9336-402X</subfield>
    <subfield code="a">Wang, Gao</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">The University of Chicago</subfield>
    <subfield code="a">Jiang, Zhuoxun</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Vanderbilt University</subfield>
    <subfield code="a">Zhou, Dan</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Harvard University</subfield>
    <subfield code="0">(orcid)0000-0002-5617-6174</subfield>
    <subfield code="a">Hormozdiari, Farhad</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Liu, Boxiang</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Stanford University</subfield>
    <subfield code="a">Rao, Abhiram</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Harvard University</subfield>
    <subfield code="0">(orcid)0000-0001-6420-9573</subfield>
    <subfield code="a">Hamel, Andrew R</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">The University of Chicago</subfield>
    <subfield code="0">(orcid)0000-0002-3035-4403</subfield>
    <subfield code="a">Pividori, Milton D</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Harvard University</subfield>
    <subfield code="0">(orcid)0000-0001-9414-300X</subfield>
    <subfield code="a">Aguet, François</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Vanderbilt University</subfield>
    <subfield code="0">(orcid)0000-0003-3020-447X</subfield>
    <subfield code="a">Bastarache, Lisa</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Icahn School of Medicine at Mount Sinai</subfield>
    <subfield code="0">(orcid)0000-0002-5318-8225</subfield>
    <subfield code="a">Jordan, Daniel M</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Université de Paris</subfield>
    <subfield code="a">Verbanck, Marie</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Icahn School of Medicine at Mount Sinai</subfield>
    <subfield code="0">(orcid)0000-0002-3144-3627</subfield>
    <subfield code="a">Do, Ron</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">The University of Chicago</subfield>
    <subfield code="0">(orcid)0000-0002-5200-3903</subfield>
    <subfield code="a">Stephens, Matthew</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Harvard University</subfield>
    <subfield code="a">Ardlie, Kristin</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">The University of Oxford</subfield>
    <subfield code="0">(orcid)0000-0002-3785-5008</subfield>
    <subfield code="a">McCarthy, Mark</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Stanford University</subfield>
    <subfield code="0">(orcid)0000-0002-5200-3903</subfield>
    <subfield code="a">Montgomery, Stephen B</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Harvard University</subfield>
    <subfield code="0">(orcid)0000-0001-6806-5845</subfield>
    <subfield code="a">Segré, Ayellet V</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Pennsylvania</subfield>
    <subfield code="0">(orcid)0000-0002-3785-5008</subfield>
    <subfield code="a">Brown, Christopher D</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">New York Genome Center</subfield>
    <subfield code="0">(orcid)0000-0002-7746-8109</subfield>
    <subfield code="a">Lappalainen, Tuuli</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Michigan</subfield>
    <subfield code="0">(orcid)0000-0001-8990-2737</subfield>
    <subfield code="a">Wen, Xiaoquan</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">The University of Chicago</subfield>
    <subfield code="0">(orcid)0000-0003-0333-5685</subfield>
    <subfield code="a">Im, Hae Kyung</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="4">rtm</subfield>
    <subfield code="a">The GTEx Consortium</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">6903951305</subfield>
    <subfield code="z">md5:4c0aa72f7c0802e672cdfe3fef6a2063</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/coloc_enloc_priors_eqtl.tar.gz</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">191294150</subfield>
    <subfield code="z">md5:5dd54bab4591c93b5e41a191a96e9c50</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/enloc_eqtl_eur.tar.gz</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1015891613</subfield>
    <subfield code="z">md5:1eb5b20242f060a9b88c88acb0cdf423</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/enloc_sqtl_eur.tar.gz</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">13251</subfield>
    <subfield code="z">md5:900e4a7d3a14ae87de25ee48f7083dba</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/eur_ld.bed.gz</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">34851462</subfield>
    <subfield code="z">md5:dda0eedeb842cfc272e76ad432753d73</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/gtex_v8_expression_mashr_snp_smultixcan_covariance.txt.gz</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">12221486</subfield>
    <subfield code="z">md5:443d7912cfa248fa7bb06e9b988217ed</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/gtex_v8_splicing_mashr_snp_smultixcan_covariance.txt.gz</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">262092800</subfield>
    <subfield code="z">md5:87f3470bf2676043c748b684fb35fa7d</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/mashr_eqtl.tar</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">668825600</subfield>
    <subfield code="z">md5:fa9167cfd2a9699f9f58ad04781e9576</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/mashr_sqtl.tar</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">7479</subfield>
    <subfield code="z">md5:8ae69f25c12e164f4edea27ea1eb555c</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/README.md</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">3452</subfield>
    <subfield code="z">md5:59629de83131dd66b20018f57520d5cf</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/SMR_gtex_v8_README.txt</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">280309893</subfield>
    <subfield code="z">md5:02788f284ea888ded8313f868c81b860</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/smultixcan_eqtl.tar.gz</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1290688752</subfield>
    <subfield code="z">md5:a9d4271727d7c00621ee05bc99df9fe7</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/smultixcan_sqtl.tar.gz</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">4253551993</subfield>
    <subfield code="z">md5:c0474256186dc58ed41705475455ebee</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/spredixcan_eqtl.tar.gz</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">10793783819</subfield>
    <subfield code="z">md5:ca31861069134046d361e54b53adf7fe</subfield>
    <subfield code="u">https://zenodo.org/record/3518299/files/spredixcan_sqtl.tar.gz</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2019-10-24</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o">oai:zenodo.org:3518299</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="4">
    <subfield code="p">Widespread dose-dependent effects of RNA expression and splicing on complex diseases and traits</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">The University of Chicago</subfield>
    <subfield code="0">(orcid)0000-0002-9153-6120</subfield>
    <subfield code="a">Barbeira, Alvaro Numa</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">GWAS and GTEx QTL integration</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;# Data usage policy&lt;/p&gt;

&lt;p&gt;When using this data, you must acknowledge the source by citing the publication &amp;quot;Widespread dose-dependent effects of RNA expression and splicing on complex diseases and traits&amp;quot; (https://doi.org/10.1101/814350).&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;pre&gt;&lt;em&gt;# GTEx GWAS integration
&lt;/em&gt;
This package contains the application of several GWAS-QTL integration methods.
The results were analyzed in [this preprint](&lt;em&gt;https://www.biorxiv.org/content/10.1101/814350v1&lt;/em&gt;)
about GTEx v8 application to several GWAS traits.
 
&lt;em&gt;``` 
&lt;/em&gt;&lt;em&gt;.
&lt;/em&gt;&lt;em&gt;|-- colocalization
&lt;/em&gt;&lt;em&gt;|   |-- coloc
&lt;/em&gt;&lt;em&gt;|   |   `-- coloc_enloc_priors_eqtl.tar.gz
&lt;/em&gt;&lt;em&gt;|   |-- enloc
&lt;/em&gt;&lt;em&gt;|   |   |-- enloc_eqtl_eur.tar.gz
&lt;/em&gt;&lt;em&gt;|   |   `-- enloc_sqtl_eur.tar.gz
&lt;/em&gt;&lt;em&gt;|   `-- eur_ld.bed.gz
&lt;/em&gt;&lt;em&gt;|-- prediction_models
&lt;/em&gt;&lt;em&gt;|   |-- gtex_v8_expression_mashr_snp_smultixcan_covariance.txt.gz
&lt;/em&gt;&lt;em&gt;|   |-- gtex_v8_splicing_mashr_snp_smultixcan_covariance.txt.gz
&lt;/em&gt;&lt;em&gt;|   |-- mashr_eqtl.tar
&lt;/em&gt;&lt;em&gt;|   `-- mashr_sqtl.tar
&lt;/em&gt;&lt;em&gt;|-- smr
&lt;/em&gt;&lt;em&gt;|   |-- SMR_gtex_v8_README.txt
&lt;/em&gt;&lt;em&gt;|   `-- SMRresults_GTEx_v8_peQTL5e-08.tar.gz
&lt;/em&gt;&lt;em&gt;|-- smultixcan
&lt;/em&gt;&lt;em&gt;|   |-- smultixcan_eqtl.tar.gz
&lt;/em&gt;&lt;em&gt;|   `-- smultixcan_sqtl.tar.gz
&lt;/em&gt;&lt;em&gt;`-- spredixcan
&lt;/em&gt;&lt;em&gt;    |-- spredixcan_eqtl.tar.gz
&lt;/em&gt;&lt;em&gt;    `-- spredixcan_sqtl.tar.gz
&lt;/em&gt;
&lt;em&gt; ```
&lt;/em&gt;&lt;em&gt; 
&lt;/em&gt;You can uncompress gzipped tarball packages &lt;em&gt;`*.tar.gz` &lt;/em&gt;in a UNIX command line with an instruction such as:
&lt;em&gt;```bash
&lt;/em&gt;&lt;em&gt;tar -xzvpf smultixcan_eqtl.tar.gz
&lt;/em&gt;&lt;em&gt;```
&lt;/em&gt;, and the tar packages (&lt;em&gt;`*.tar`&lt;/em&gt;) with an analogous instruction:
&lt;em&gt;```bash
&lt;/em&gt;&lt;em&gt;tar -xvpf mashr_eqtl.tar
&lt;/em&gt;&lt;em&gt;```
&lt;/em&gt;

&lt;em&gt;## Preliminaries
&lt;/em&gt;
&lt;strong&gt;**&lt;/strong&gt;Finemapping&lt;strong&gt;** &lt;/strong&gt;results are contained in a separate release due to size constraints.

GWAS summary statistics for 114 traits were harmonized and imputed to GTEx v8 variants with MAF&amp;gt;0.01 using only european samples.
(summary imputation software [here](&lt;em&gt;https://github.com/hakyimlab/summary-gwas-imputation&lt;/em&gt;)). 
Some of the following analyses used the full set of 114 traits,
while some focused only on 87 traits whose imputed associations showed no deflation
(the imputation algorithm is conservative, and studies with too few available variants have a depleted distribution of association p-values after imputation).

The harmonized and imputed GWAS summary statistics are contained in a separate release due to size constraints. 
For completeness&amp;#39; sake, the imputed summary statistics look like:
&lt;em&gt;```
&lt;/em&gt;&lt;em&gt;variant_id panel_variant_id   chromosome position   effect_allele  non_effect_allele  current_build  frequency  sample_size    zscore pvalue effect_size    standard_error imputation_status  n_cases
&lt;/em&gt;&lt;em&gt;rs554008981    chr1_13550_G_A_b38 chr1   13550  A  G  hg38   0.017316017316017316   336474 -2.2919929353647097    0.021906050841240293   NA NA imputed    NA
&lt;/em&gt;&lt;em&gt;rs201055865    chr1_14671_G_C_b38 chr1   14671  C  G  hg38   0.012987012987012988   336474 -0.9559192804440632    0.33911301727494103    NA NA imputed    NA
&lt;/em&gt;&lt;em&gt;...
&lt;/em&gt;&lt;em&gt;```
&lt;/em&gt;
The GWAS were split in approximately independent LD regions (Berisa-Pickrell)/
GWAS regions are defined in &lt;em&gt;`eur_ld.bed.gz` &lt;/em&gt;(note that a few of them are ill-defined in hg38 and where ignored; only completely defined regions were used). 

&lt;em&gt;## Colocalization
&lt;/em&gt;
&lt;em&gt;### Enloc
&lt;/em&gt;
ENLOC ([see fotware here](&lt;em&gt;https://github.com/xqwen/integrative&lt;/em&gt;))
was run for sQTLs and eQTLs using individuals of european ancestry and DAP-G QTL enrichment results on 87 traits.
Result files are included in &lt;em&gt;`enloc_eqtl_eur.tar.gz` &lt;/em&gt;and &lt;em&gt;`enloc_sqtl_eur.tar.gz`
&lt;/em&gt;Each file contains a particular tissue-trait combination.
Each row details colocalization between a GWAS region (Berisa-Pickrell) and gene&amp;#39;s or intron&amp;#39;s cis-window.

A region might overlap multiple genes/introns or viceversa.
Each ENLOC file contains the following columns:

&lt;strong&gt;* &lt;/strong&gt;gwas_locus: GWAS LD region
&lt;strong&gt;* &lt;/strong&gt;molecular_qtl_trait: gene or intron
&lt;strong&gt;* &lt;/strong&gt;locus_gwas_pip: posterior inclusion probability of variants in the GWAS LD region
&lt;strong&gt;* &lt;/strong&gt;locus_rcp: regional colocalization probability (main colocalization measure)
&lt;strong&gt;* &lt;/strong&gt;lead_coloc_SNP: snp with highest RCP
&lt;strong&gt;* &lt;/strong&gt;lead_snp_rcp: rcp of the lead coloc snp


&lt;em&gt;### Coloc
&lt;/em&gt;
Coloc ([see software here](&lt;em&gt;https://cran.r-project.org/web/packages/coloc/index.html&lt;/em&gt;))
was run using prior probabilities estimated from QTL enrichment of GWAS variants (computed via ENLOC).
Results for eQTL are available in &lt;em&gt;`coloc_enloc_priors_eqtl.tar.gz`&lt;/em&gt;. 
Each file contains results for a trait-tissue combination. Columns are:
&lt;strong&gt;* &lt;/strong&gt;gene_id: gene or intron id
&lt;strong&gt;* &lt;/strong&gt;p0: probability that neither QTL nor GWAS contain a causal variant
&lt;strong&gt;* &lt;/strong&gt;p1: probability that only GWAS contains a causal variant
&lt;strong&gt;* &lt;/strong&gt;p2: probability that only QTL has a causal variant
&lt;strong&gt;* &lt;/strong&gt;p3: probability that GWAS and QTL have a causal variant and it&amp;#39;s distinct
&lt;strong&gt;* &lt;/strong&gt;p4: probability that GWAS and QTL have a causal variant and it&amp;#39;s the same (main colocalization measure)

&lt;em&gt;## PrediXcan
&lt;/em&gt;
&lt;em&gt;`mashr_eqtl.tar` &lt;/em&gt;and &lt;em&gt;`mashr_sqtl.tar` &lt;/em&gt;contain prediction models 
(trained on expression or splicing data respectively, for 49 GTEx tissues) and LD compilations 
to be used with PrediXcan, S-PrediXcan, MultiXcan and S-MultiXcan.

For every tissue, the &lt;em&gt;`mashr_{tissue}.db` &lt;/em&gt;file is a SQLite file with the prediction model definitions.
&lt;em&gt;`mashr_{tissue}.txt.gz` &lt;/em&gt;is a gzipped-text file with the upper triangular matrices of covariance between snps
within a gene/intron prediction model.

Many variants in these models don&amp;#39;t have an rsid. To fully leverage the information in these models, 
it is advised to at least harmonize to GTEx variants, and if possible impute as we did [here](&lt;em&gt;https://github.com/hakyimlab/summary-gwas-imputation&lt;/em&gt;).

&lt;em&gt;### S-PrediXcan
&lt;/em&gt;
S-PrediXcan was run for the 114 harmonized and imputed traits, on eQTL and sQTL mashr prediction models.
All of the GWAS traits had the same format, so that the following format parameters were used with S-PrediXcan:

&lt;em&gt;```
&lt;/em&gt;&lt;em&gt;--snp_column panel_variant_id --effect_allele_column effect_allele --non_effect_allele_column non_effect_allele --zscore_column zscore \
&lt;/em&gt;&lt;em&gt;--keep_non_rsid --additional_output --model_db_snp_key varID \
&lt;/em&gt;&lt;em&gt;```
&lt;/em&gt;
Each file is a CSV, with each row containing a gene/intron association at a given trait-tissue combination:
&lt;strong&gt;* &lt;/strong&gt;gene: ENSEMBLE ID or intron id
&lt;strong&gt;* &lt;/strong&gt;gene_name: HUGO name or intron id
&lt;strong&gt;* &lt;/strong&gt;zscore: predicted association z-score
&lt;strong&gt;* &lt;/strong&gt;effect_size: estimated effect size
&lt;strong&gt;* &lt;/strong&gt;pvalue: association p-value
&lt;strong&gt;* &lt;/strong&gt;var_g: estimated variance of predicted expression or splicing
&lt;strong&gt;* &lt;/strong&gt;pred_perf_r2: prediction model cross-validated performance
&lt;strong&gt;* &lt;/strong&gt;pred_perf_pval: prediction model cross-validated performance
&lt;strong&gt;* &lt;/strong&gt;pred_perf_qval: deprecated, empty field left for compatibility
&lt;strong&gt;* &lt;/strong&gt;n_snps_used: number of snps in the intersection of GWAS and model
&lt;strong&gt;* &lt;/strong&gt;n_snps_in_cov: number of snps in the LD compilation
&lt;strong&gt;* &lt;/strong&gt;n_snps_in_model: number of snps in the model
&lt;strong&gt;* &lt;/strong&gt;best_gwas_p: smallest p-value acros GWAS snps used in this model
&lt;strong&gt;* &lt;/strong&gt;largest_weight: largest prediction model weight

&lt;em&gt;### S-Multixcan
&lt;/em&gt;
S-MultiXcan results were generated from the above S-PrediXcan results. Each fiel contains multi-tissue associations for a given trait:


&lt;strong&gt;* &lt;/strong&gt;gene: ENSEMBLE ID or intron id
&lt;strong&gt;* &lt;/strong&gt;gene_name: HUGO name or intron id
&lt;strong&gt;* &lt;/strong&gt;pvalue: multi-tissue association p-value
&lt;strong&gt;* &lt;/strong&gt;n: number of models avialble for this gene/intron
&lt;strong&gt;* &lt;/strong&gt;n_indep: number of independent components of variation in predicted expression/splicing (surviving principal components) 
&lt;strong&gt;* &lt;/strong&gt;p_i_best: highest single-tissue p-value (S-PrediXcan) 
&lt;strong&gt;* &lt;/strong&gt;t_i_best: tissue of highest p-value
&lt;strong&gt;* &lt;/strong&gt;p_i_worst: lowest single-tissue p-value (S-PrediXcan)
&lt;strong&gt;* &lt;/strong&gt;t_i_worst: tissue of lowest p-value
&lt;strong&gt;* &lt;/strong&gt;eigen_max: maximum eigenvalue of SVD
&lt;strong&gt;* &lt;/strong&gt;eigen_min: minimum eigenvalue of SVD
&lt;strong&gt;* &lt;/strong&gt;eigen_min_kept: smallest eigenvalue retained after discarding smallest variations
&lt;strong&gt;* &lt;/strong&gt;z_min: minimum single-tissue z-score
&lt;strong&gt;* &lt;/strong&gt;z_max: maximum single-tissue z-score
&lt;strong&gt;* &lt;/strong&gt;z_mean: mean single-tissue zscre
&lt;strong&gt;* &lt;/strong&gt;z_sd: standard deviation of the single-tissue z-scores
&lt;strong&gt;* &lt;/strong&gt;tmi: trace of M * M_i where M is predicted expression/splicing covariance across tissues for a gene, and M_i is its SVD pseudo-inverse
&lt;strong&gt;* &lt;/strong&gt;status: computation status, 0 if no errors

&lt;em&gt;## SMR
&lt;/em&gt;
See &lt;em&gt;`SMR_gtex_v8_README.txt` &lt;/em&gt;for details.&lt;/pre&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;# Disclaimer&lt;/p&gt;

&lt;p&gt;The data is provided &amp;quot;as is&amp;quot;, and the authors assume no responsibility for errors or omissions. &amp;nbsp;&lt;br&gt;
The User assumes the entire risk associated with its use of these data. &amp;nbsp;&lt;br&gt;
The authors shall not be held liable for any use or misuse of the data described and/or contained herein. &amp;nbsp;&lt;br&gt;
The User bears all responsibility in determining whether these data are fit for the User&amp;#39;s intended use. &amp;nbsp;&lt;/p&gt;

&lt;p&gt;The information contained in these data is not better than the original sources from which they were derived,&lt;br&gt;
and both scale and accuracy may vary across the data set. &amp;nbsp;&lt;br&gt;
These data may not have the accuracy, resolution, completeness, timeliness, or other characteristics&lt;br&gt;
appropriate for applications that potential users of the data may contemplate. &amp;nbsp;&lt;br&gt;
&amp;nbsp;&lt;br&gt;
The user is responsible to comply with any data usage policy from the original GWAS studies;&lt;br&gt;
refer to the list of traits described [here](https://www.biorxiv.org/content/10.1101/814350v1)&lt;br&gt;
to identify their respective Consortia&amp;#39;s requirements.&lt;/p&gt;

&lt;p&gt;&lt;br&gt;
THE DATA IS PROVIDED WITHOUT WARRANTY OF ANY KIND,&lt;br&gt;
EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,&lt;br&gt;
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.&lt;br&gt;
IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY,&lt;br&gt;
WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,&lt;br&gt;
OUT OF OR IN CONNECTION WITH THE DATA OR THE USE OR OTHER DEALINGS IN THE DATA.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isSourceOf</subfield>
    <subfield code="a">10.1101/814350</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.3518298</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.3518299</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
1,681
2,768
views
downloads
All versions This version
Views 1,6811,680
Downloads 2,7682,768
Data volume 4.7 TB4.7 TB
Unique views 1,4261,425
Unique downloads 1,1661,166

Share

Cite as