Published June 16, 2022 | Version v2
Dataset Open

UK Biobank release and systematic evaluation of optimised polygenic risk scores for 53 diseases and quantitative traits

Description

Summary-level GWAS data for 53 traits generated by Genomics plc as presented in:

Thompson D. et al. UK Biobank release and systematic evaluation of optimised polygenic risk scores for 53 diseases and quantitative traits (https://doi.org/10.1101/2022.06.16.22276246)

If you have any questions or comments regarding these files, please contact Genomics plc at research@genomicsplc.com

NOTES

These analyses were carried out using the full UK Biobank (UKB) imputation data release (v3b). After removal of exclusions and withdrawals, a subset of 337,151 UKB individuals, the White British Unrelated (WBU) subgroup, was defined as the intersection of two sample groups created by Bycroft et al 2018 (Nature 562, 203-209): the ‘White British ancestry’ group (UKB Data Field 22006) and the ‘used in genetic principal components’ group (UKB Data Field 22020), the latter being high quality samples that were filtered to avoid closely related individuals. All GWAS analyses were performed on the WBU subgroup.

Phenotypes were defined as described in Supplementary Table 1 ‘Phenotype definitions’ using a combination of Hospital Episode Statistics, Cancer Registry reports (where applicable) and self-report responses, with the exception of coronary artery disease (CAD). GWAS data was generated for both a “narrow” and a “broad” definition of CAD. The former was used as part of the training data for the Enhanced CAD PRS, the latter was used as part of the training data for the Enhanced CVD PRS. The phenotype definitions for “narrow” and a “broad” CAD are as follows:

Narrow CAD
(includes angina)
ICD10 codes (where .X indicates all subcodes) from both hospital and death records: I21, I22, I23, I24.1, I25.2, I20.X. ICD9 codes: 410-412, 42979, 413.X. OPCS-4 codes (K40.1–40.4, K41.1–41.4, K45.1–45.5,K49.1–49.2, K49.8–49.9, K50.2, K75.1–75.4, K75.8–75.9), self-reported heart attack (UKB codes 1075 in field 20002; code 1 in field 6150), self-reported coronary angioplasty (ptca) or coronary artery bypass graft (UKB codes 1070 and 1095  in field 20004), self-reported angina.
Broad CAD
(includes angina and all ischaemic heart disease)
As for Narrow CAD, plus ICD10 codes I24.X, I25X, and ICD9 codes 414.X (where .X indicates all subcodes).

Note that there is no GWAS for cardiovascular disease (CVD) per se. This is because the UKB training data for the Enhanced CVD PRS consisted of separate GWASs for “narrow” CAD and ischaemic stroke.

All analyses included Age at assessment, sex (for non-sex specific traits), genotyping chip, and 10 principal components as covariates.

GWAS summary statistics for each trait were generated by applying PLINK 2.0 to the WBU subgroup, using a logistic regression for disease traits, and a linear regression model for quantitative traits. For chromosome X variants males were treated as having 0 or 2 alternative alleles.

The results are not adjusted for genomic control.

DATA FILE CONTENT DESCRIPTION (DISEASE TRAITS)

cpra Variant ID in ‘CPRA’ format. Position reflects position in b37
chrom Chromosome
pos Position in base pairs (b37, 1-based)
alt Alternative allele (effect allele)
beta Effect size (log odds ratio)
standard_error Standard error of beta
minus_log10_p Minus log(base 10) of P-value
ref Reference allele (non-effect allele)
ncase Number of cases
ncontrol Number of controls

DATA FILE CONTENT DESCRIPTION (QUANTITATIVE TRAITS)

cpra Variant ID in ‘CPRA’ format. Position reflects position in b37
chrom Chromosome
pos Position in base pairs (b37, 1-based)
alt Alternative allele (effect allele)
beta Effect size
standard_error Standard error of beta
minus_log10_p Minus log(base 10) of P-value
ref Reference allele (non-effect allele)
ntotal Total sample size

FILE NAMES

The following is a list of traits and their corresponding file names.

DISEASE TRAITS

Age-related macular degeneration amd_strict_UKB_WBU.csv.gz
Alzheimer's disease alzheimers_disease_UKB_WBU.csv.gz
Asthma asthma_UKB_WBU.csv.gz
Atrial fibrillation atrial_fibrillation_UKB_WBU.csv.gz
Bipolar disorder bipolar_disorder_UKB_WBU.csv.gz
Bowel cancer CRC_UKB_WBU.csv.gz
Breast cancer BC_UKB_WBU_women.csv.gz
Coeliac disease celiac_disease_UKB_WBU.csv.gz
Narrow coronary artery disease NARROW_CAD_UKB_WBU.csv.gz
Broad coronary artery disease BROAD_CAD_UKB_WBU.csv.gz
Crohn's disease crohns_disease_UKB_WBU.csv.gz
Epithelial ovarian cancer OC_UKB_WBU.csv.gz
Hypertension HT_UKB_WBU.csv.gz
Ischaemic stroke IS_stroke_UKB_WBU.csv.gz
Melanoma melanoma_UKB_WBU.csv.gz
Multiple sclerosis multiple_sclerosis_UKB_WBU.csv.gz
Osteoporosis OP_WBU_training.csv.gz
Prostate cancer PC_UKB_WBU.csv.gz
Parkinson's disease parkinsons_disease_UKB_WBU.csv.gz
Primary open angle glaucoma POAG_WBU_training.csv.gz
Psoriasis psoriasis_UKB_WBU.csv.gz
Rheumatoid arthritis rheumatoid_arthritis_UKB_WBU.csv.gz
Schizophrenia schizophrenia_UKB_WBU.csv.gz
Systemic lupus erythematosus lupus_UKB_WBU.csv.gz
Type 1 diabetes t1d_UKB_WBU.csv.gz
Type 2 diabetes T2D_UKB_WBU.csv.gz
Ulcerative colitis ulcerative_colitis_UKB_WBU.csv.gz
Venous thromboembolic disease VTE_UKB_WBU.csv.gz

QUANTITATIVE TRAITS

Age at menopause age_at_menopause_UKB_WBU.csv.gz
Apolipoprotein A1 apolipoprotein_a1_UKB_WBU.csv.gz
Apolipoprotein B apolipoprotein_b_UKB_WBU.csv.gz
Body mass index bmi_UKB_WBU.csv.gz
Calcium calcium_UKB_WBU.csv.gz
Docosahexaenoic acid docosahexaenoic_acid_UKB_WBU.csv.gz
Estimated bone mineral density T-score BMD_WBU_training.csv.gz
Estimated glomerular filtration rate (creatinine based) egfr_UKB_WBU.csv.gz
Estimated glomerular filtration rate (cystatin based) egfr_cys_UKB_WBU.csv.gz
Glycated haemoglobin hba1c_UKB_WBU_nodiabetes.csv.gz
High density lipoprotein cholesterol hdl_cholesterol_UKB_WBU.csv.gz
Height height_UKB_WBU.csv.gz
Intraocular pressure iop_WBU_training.csv.gz
Low density lipoprotein cholesterol ldl_UKB_WBU_nostatins.csv.gz
Omega-6 fatty acids omega_6_fatty_acids_UKB_WBU.csv.gz
Omega-3 fatty acids omega_3_fatty_acids_UKB_WBU.csv.gz
Phosphatidylcholines phosphatidylcholines_UKB_WBU.csv.gz
Phosphoglycerides phosphoglycerides_UKB_WBU.csv.gz
Polyunsaturated fatty acids polyunsaturated_fatty_acids_UKB_WBU.csv.gz
Resting heart rate resting_heart_rate_UKB_WBU.csv.gz
Remnant cholesterol (Non-HDL, Non-LDL cholesterol) remnant_cholesterol__UKB_WBU.csv.gz
Sphingomyelins sphingomyelins_UKB_WBU.csv.gz
Total cholesterol total_cholesterol_UKB_WBU.csv.gz
Total fatty acids total_fatty_acids_UKB_WBU.csv.gz
Total triglycerides total_triglycerides_UKB_WBU.csv.gz

Files

README.txt

Files (17.9 GB)

Name Size Download all
md5:854ff3d6ec0c9baf44dd92facd2267cb
327.5 MB Download
md5:0af7d2dce8edb9853038c8dbc67f3b02
349.2 MB Download
md5:045b6b5b3bcd229660d2823518200076
349.3 MB Download
md5:97171cab82138cfd96bddb1de51adec2
320.3 MB Download
md5:5d407b782e3c2672f15241c0baebfeb8
328.8 MB Download
md5:36ae5d56278e579a5c3a8310a8b3eef9
367.0 MB Download
md5:72121041047710d1c927aa6609bca687
355.2 MB Download
md5:34c058db815f8fd332de0ffcdd43c352
350.4 MB Download
md5:78affa82bb7b3bb8b8f2c2e325c0b77f
350.5 MB Download
md5:6dd0a686af2f3eea480b325c19b4945b
313.1 MB Download
md5:94f5efed0abf48c16487b39a26bcb7f5
328.5 MB Download
md5:d7e19a3ec15f09d955f5d2f95309b94e
358.7 MB Download
md5:a9d341c5bbfeb6ddcbf4b96f2913786a
328.4 MB Download
md5:588c1a12fea975a8e78d37faab9ab086
351.8 MB Download
md5:ca5b568b6093d355f76f9b240a71adce
347.4 MB Download
md5:d592fbb7060fe1481c1e840cf82396cd
349.7 MB Download
md5:9af12e4e66597243c243960ce07e22e4
324.7 MB Download
md5:5f13889f34f47b953cb5ecd8bcedd4fb
328.5 MB Download
md5:c31a9f6ec5caa69645b4da2c2e6066c4
328.6 MB Download
md5:36f484ad07f150fa601e5d64e323bfe4
328.3 MB Download
md5:aa2d5a26bdcc44c05be7f1cf4834456d
328.3 MB Download
md5:160f2536266e4e7ce8510b8fb48be5c5
328.3 MB Download
md5:06fc656ea6c32363039a8c40218d7b4f
365.1 MB Download
md5:01068beccc939c4a48a9707469616d78
303.6 MB Download
md5:de83c76b78242b8f75f24f741e5401ce
347.1 MB Download
md5:63bf1f9d01dd0d451fc29955cbc69c97
327.9 MB Download
md5:ed7a7ff43578918b4ddae3f20a3c18b0
331.0 MB Download
md5:a3512e5d91e35509b07e4d330ca7a452
353.6 MB Download
md5:ee31ba99cd025a8c74748904c9161718
348.4 MB Download
md5:a1c36006329c0a95d17cd2e88eb34c71
358.0 MB Download
md5:ee3abff1bc2df1045e5d21f05be5d4de
336.3 MB Download
md5:1e52c244119be32f79996d52f5307272
320.3 MB Download
md5:096dbabfb1af66f98577a9a9750f0a2d
322.4 MB Download
md5:86c7d48708cb16e9a6dffa38320723db
347.1 MB Download
md5:af5c358413f349d1b255722d0469d0b1
351.2 MB Download
md5:78027e32588e2e471eb43764b65cb101
347.2 MB Download
md5:20d66841b4c4a9be6267c58e83f02845
320.7 MB Download
md5:a2bff1d2bd9f7d80f926213fb79c2e06
320.8 MB Download
md5:20567ebf092b13dc352ff297006ce09a
343.3 MB Download
md5:b66617d10645cae69c1e27bd13f59a67
322.8 MB Download
md5:e94bb7fae75940e222c002bcf3bf1a3f
347.6 MB Download
md5:c0c17bc07a1c34c847a461418c103285
8.1 kB Preview Download
md5:5735a0516839493290d47e19ccec03a0
321.0 MB Download
md5:da39300607c03f867b357f59960755bf
328.5 MB Download
md5:a13ca50a81bdbe6ef7b19a810e5e24ad
356.3 MB Download
md5:b18222dadf6b5ae2d6dcbec471cecc86
335.3 MB Download
md5:3020699deded250375e152c3aa338764
325.2 MB Download
md5:1449fbc9dfba0c4863eb33785e98d7f8
345.1 MB Download
md5:631034654c41dce276a44a24b85cc07a
356.0 MB Download
md5:6f25c0eb062311f317d1c87c2a64c3f9
323.4 MB Download
md5:2a0b6865bfca4d5bb553400429e0d574
321.5 MB Download
md5:1eecd00625a8dc23b2a067ba2b4b9a45
321.9 MB Download
md5:7886aea5222bbfbfa1e6ac5496ed6c23
353.1 MB Download
md5:966cdb79c86cef519e7a8e360f2d4379
360.6 MB Download

Additional details

Related works

Is supplement to
Preprint: 10.1101/2022.06.16.22276246 (DOI)