Published October 11, 2020 | Version v1.1.0
Software Open

Efficient phasing and imputation of low-coverage sequencing data using large reference panels

  • 1. University of Lausanne

Description

Low-coverage whole genome sequencing followed by imputation has been proposed as a cost-effective genotyping approach for disease and population genetics studies. However, its competitiveness against SNP arrays is undermined as current imputation methods are computationally expensive and unable to leverage large reference panels.

Here, we describe a method, GLIMPSE, for phasing and imputation of low-coverage sequencing datasets from modern reference panels. We demonstrate its remarkable performance across different coverages and human populations. It achieves imputation of a full genome for less than $1, outperforming existing methods by orders of magnitude, with an increased accuracy of more than 20% at rare variants. We also show that 1x coverage enables effective association studies and is better suited than dense SNP arrays to access the impact of rare variations. Overall, this study demonstrates the promising potential of low-coverage imputation and suggests a paradigm shift in the design of future genomic studies.

Files

GLIMPSE-1.1.0.zip

Files (140.5 MB)

Name Size Download all
md5:b73d4f5b43386a4789bdf85f09bb48d1
140.5 MB Preview Download

Additional details

Related works

Is documented by
Preprint: 10.1101/2020.04.14.040329 (DOI)

Funding

A systems genetics approach to characterize disease etiology PP00P3_176977
Swiss National Science Foundation