Published May 12, 2015 | Version v1
Dataset Open

Prediction of nucleosome dyads for the K562 cell line in the hg19 genome assembly

  • 1. CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences

Description

Predicting dyads from MNase-seq data

I needed the location of nucleosomal dyads in the K562 cell line (ENCODE tier 1 line). Surprisingly, although plenty of MNase-seq data for that cell line is available, no nucleosome and dyad prediction exists.

Running NuMap

I found the NuMap software by Anton Valouev to do exactly what I intended.

Since it is in a somewhat obscure page and this seemed to be the only place where this software was, I have uploaded it into a Github repository for the sake of preservation (https://github.com/orphancode/NuMap).

Predicting dyads from MNase-seq data with NuMap seemed trivial: I downloaded the K562 MNase-seq data set (11 replicates ~85Gb!!), combined all replicates and ran NuMap on the data(instructions on the Github README).

From NuMap output there are dyad positions in bed format and you can also produce several metrics to evaluate how good the prediction was.

Distograms & phasograms

Valouev describes two measurements of the frequencies of distances between MNase-seq reads. The frequency of distances between reads mapping to opposite strands can be used to build a “distogram”, which ilustrates the expected nucleosome length (147 bp) - this is consistent across most eukaryotic cells. The frequency of distances between reads mapping to the same strand gives a measurement of the distance between nucleosomes, as they’re separated by some linker DNA - (Valouev calls this plot a “phasogram”). This measurement, on the other hand tends to be species and cell-type specific.

K562 predictions:

The expected 147 bp nucleosome length in K562 cells.

The average distance between dyads in K562 cells seems to be 185 bp.

References:

Valouev, A., Johnson, S. M., Boyd, S. D., Smith, C. L., Fire, A. Z., Sidow, A. (2011). Determinants of nucleosome organization in primary human cells. Nature, 474(7352), 516–520. http://doi.org/10.1038/nature10002

Notes

Initially posted as a blogpost at https://andre-rendeiro.com/2015/05/12/predicting_dyads_from_mnase

Files

Files (53.1 MB)

Name Size Download all
md5:8c8acd712f0fc1044fbd2e4cb55704ae
53.1 MB Download