Published November 23, 2025 | Version v2
Dataset Open

LD reference panels - All of Us and UK Biobank

Authors/Creators

  • 1. ROR icon Michigan State University

Description

All of Us Reference Panels

LD reference panels construction

  • Select genetic variants that are available in the All of Us (CDRv7, Controlled Tier) array genotype dataset.
  • The following variant filtering/removing criteria were applied: (a) located on sex chromosomes; (b) missing value frequency > 0.03; and (c) MAF < 0.01.
  • LD blocks were defined according to the LD block definition in PRS-CSx [Ref] (the start and end base pair position of the blocks of the LD reference panels used in PRS-CSx).
  • LD blocks were saved in matrices with the "dgCMatrix" class and in "RDS" file format, together compressed with map files including the variant information.

Sample size

  • African American: 62,331
  • Hispanic: 59,640

Number of variants

  • African American: 993,783
  • Hispanic: 875,221

 

UK Biobank Reference Panels

LD reference panels construction

  • Select genetic variants that are available in both 1000 Genomes and the UK Biobank genotyped dataset.
  • The following variant filtering/removing criteria were applied: (a) located on sex chromosomes; (b) missing value frequency > 0.02; and (c) MAF < 0.05.
  • LD blocks were saved in matrices with the "dgCMatrix" class and in "RDS" file format, together compressed with map files including the variant information.

Sample size

  • AFR:7,507
  • AMR: 687

Number of variants

  • AFR: 1,234,911
  • AMR: 1,183,556

Files

Files (15.6 GB)

Name Size Download all
md5:52d466430656c8d8d525ebb438ad9501
3.5 GB Download
md5:a83be4a12dcd67a897438337ecac7720
5.1 GB Download
md5:0c19bfce979a29eea79d1de7722dc12b
4.3 GB Download
md5:31252fdf4bea00686273525e39192855
2.7 GB Download