Published April 1, 2021 | Version v1.2020_04_01
Dataset Open

Modified GRCh38 genome with U2AF1 region fix

  • 1. Washington University in St Louis

Description

A modified version of the GRCh38 human reference genome that replaces the region chr21:6427259-6580181 with "N" characters. This creates a coordinate-compatible build that removes a problematic duplication region  This duplicated sequence in the original reference causes mismapping or multimapping of reads to a locus that includes the genes CBS, U2AF1, FRGCA, and CRYAA and hinders variant detection in those genes.  More details are available at https://www.ncbi.nlm.nih.gov/grc/human/issues/HG-2544

This reference is useful in applications where detection of U2AF1 mutations is critical, including studies of hematological cancers, or studies of spliceosome dysfunction. A publication with more details is forthcoming.

Files

Files (873.8 MB)

Name Size Download all
md5:8eb4e81beae3b9066535658dbed55071
873.8 MB Download