MAGE: Multi-ancestry Analysis of Gene Expression
Description
MAGE comprises RNA-seq data from lymphoblastoid cell lines derived from 731 individuals from the 1000 Genomes Project (1KGP), representing 26 globally-distributed populations across five continental groups. These data offer a large, geographically diverse, open access resource to facilitate studies of the distribution, genetic underpinnings, and evolution of variation in human transcriptomes and include data from several ancestry groups that were poorly represented in previous studies.
Briefly, this repo contains the following data:
- Sample metadata and sequencing metrics
- Gene expression and splicing matrices used for e/sQTL mapping and analyses of global trends of expression/splicing diversity
- cis-e/sQTL mapping results, including aFC estimates for cis-eQTLs
- Functional annotations of cis-e/sQTLs
- Results of colocalization analysis between MAGE e/sQTLs and complex trait GWAS from the PAGE study
- Results of analyses of global trends of expression/splicing diversity
- Jointly-generated top genotype PCs for samples in MAGE and other resources with paired WGS/RNA-seq data (Geuvadis, GTEx, AFGR)
READMEs are provided for all data in the repo.
Files
MAGE.v1.0.data.zip
Files
(59.5 GB)
Name | Size | Download all |
---|---|---|
md5:9b32d1e24aa883b3dc57359598420203
|
59.5 GB | Preview Download |
Additional details
Related works
- Is published in
- Preprint: 10.1101/2023.11.04.565639 (DOI)
Funding
- National Institutes of Health
- Functional and fitness consequences of human genetic variation R35GM133747
Dates
- Available
-
2024-01-19