Published January 8, 2025 | Version v1
Dataset Open

Linkage-Disequilibrium (LD) matrices for six continental ancestry groups from the UK Biobank

Description

This dataset contains Linkage Disequilibrium (LD) matrices for six ancestry groups from the UK Biobank.

LD matrices record the SNP-by-SNP correlations in a given sample of individuals from the general population. In this case, we threshold the matrices so that we only record the correlations between variants in the same LD block (defined by LDetect). The continental ancestry groups are defined by the Pan-UKB initiative as:

  • EUR = European ancestry (N=362446)
  • CSA = Central/South Asian ancestry (N=8284)
  • AFR = African ancestry (N=6255)
  • EAS = East Asian ancestry (N=2700)
  • MID = Middle Eastern ancestry (N=1567)
  • AMR = Admixed American ancestry (N=987)

The sample sizes here are restricted to unrelated individuals in the UK Biobank. The matrices were computed using  magenpy and quantized to int8 data type for better compressibility. The standard matrices (EUR.tar.gz, AFR.tar.gz, ...) contain pairwise correlations for 1.4 million HapMap3+ variants. For European samples, we also provide LD matrices that record pairwise correlations for up to 18 million variants (EUR_18m_variants.tar.gz)

For more details on how these matrices were computed, please consult our manuscript:

Towards whole-genome inference of polygenic scores with fast and memory-efficient algorithms
Shadi Zabad, Chirayu Anant Haryan, Simon Gravel, Sanchit Misra, Yue Li

To access these matrices, consult the codebase of magenpy, our custom python package with special data structures for processing these LD matrices.

Files

Files (22.4 GB)

Name Size Download all
md5:77e9c6c62ea36f88c894694e68611d99
228.5 MB Download
md5:18c8130e167ce8cc404693524c74571e
260.7 MB Download
md5:38958331db0aba28edbd0eeec924d92a
357.7 MB Download
md5:783ad30af1557875acc1ab6e7c32897e
272.3 MB Download
md5:41826edf74f9cc14b3e97024119ad2e6
309.6 MB Download
md5:e935cb6f5ede6f9cc20ef386eaea8152
20.6 GB Download
md5:79a8ece6420d2a0d579d4fef1626a953
359.4 MB Download