There is a newer version of the record available.

Published November 25, 2016 | Version 0.3.1.post1
Dataset Open

ReMM score

  • 1. Berlin Institute of Health (BIH)

Description

The Regulatory Mendelian Mutation (ReMM) score was created for relevance prediction of non-coding variations (SNVs and small InDels) in the human genome (hg19/hg38) in terms of Mendelian diseases.

 

Usage

The ReMM score is genome position wise (nucleotide changes are neglected). We precomputed all positions in the human genome (hg19 and hg38 release) and stored the values in a tabix file (1-based). The scores ranging from 0 (non-deleterious) to 1 (deleterious).

If you want to use the ReMM score together with the Genomiser, please have a look at the Exomiser framework manual

 

ReMM score changelog

0.3.1.post1:

  • New hg38 release. Completely retrained on the new genome build.
    • Training data:
      • Liftover positives. No change in size.
      • Negatives used from CADD v1.4 GRCh38 (human derived), filtered as described in the original paper (https://doi.org/10.1016/j.ajhg.2016.07.005). Size slightly different (hg38: 13,902,234; hg19: 14,755,199) .
    • Features
      • Same size as in hg19: 26 features.
      • We tried to use the same features as in hg19. Sometimes new versions of data have to be used (e.g. DGV, ISCA, dbVAR).
    • Training was done with the parSMURF implementation of hyperSMURF.
      • Same hg19 parameters are used.
    • Metrics via 10-fold cytoband cross-validation (same cytoband to fold map):
  • Scores for hg19 in this release are the same as version 0.3.1. Only the files have been renamed.

0.3.1:

  • Bugfix of region chr17:79759050-81195210. Region is missing in older versions.

0.3:

  • First official public version.
  • Values for positions in training data are computed by cytoband-aware 10 fold cross-validation.
  • Other position scores are compted by a generalized model of all training data.
  • This version was used in the Genomiser publication (Smeley et.al. A Whole-Genome Analysis Framework for Effective Identification of Pathogenic Regulatory Variants in Mendelian Disease. AHJG. 2016)

Files

Files (29.4 GB)

Name Size Download all
md5:5dd21718323e7919e18336cae14568fd
11.9 GB Download
md5:e9839b87e74cf9c57f0242a79ef8b120
2.5 MB Download
md5:02e8c6602dde71a235301bf550954843
17.5 GB Download
md5:96c1b335fd832075dde251e1efd022f1
2.8 MB Download
md5:cf409f7085ef416a8c2783984e74f50c
264 Bytes Download

Additional details

Related works

Is documented by
10.1016/j.ajhg.2016.07.005 (DOI)

References

  • Damian Smedley, Max Schubach, Julius OB Jacobsen, Sebastian Köhler, Tomasz Zemojtel, Malte Spielmann, Marten Jäger, Harry Hochheiser, Nicole L Washington, Julie A McMurry, Melissa A Haendel, Christopher J Mungall, Suzanna E Lewis, Tudor Groza, Giorgio Valentini, Peter N Robinson. (2016). A Whole-Genome Analysis Framework for Effective Identification of Pathogenic Regulatory Variants in Mendelian Disease. The American Journal of Human Genetics, 99(3), 595–606. http://doi.org/10.1016/j.ajhg.2016.07.005