Dataset Open Access

Simulated data for "Spot-On: robust model-based analysis of single-particle tracking experiments"

Anders Sejr Hansen; Maxime Woringer; Jonathan B Grimm; Luke D Lavis; Robert Tjian; Xavier Darzacq

Generation of simulated data

To systematically evaluate the performance of Spot-On as well as other common analysis tools such as MSDi and vbSPT, we considered a comprehensive set of 3480 realistic SPT simulations spanning the range of plausible dynamics. The simulations were performed using simSPT, which is freely available at GitLab: The simulation methods are described in detail at GitLab. A full description of the parameters which allows exact reproduction of the simulations is available together with the data (see Data Availability section). Briefly, we parameterized simSPT to consider that particles diffuse inside a sphere (the nucleus) of 8 µm diameter illuminated using HiLo illumination (assuming a HiLo beam width of 4 µm), with an axial detection range of ~700 nm, centered at the middle of the HiLo beam. Molecules are assumed to have a half-life of 4 frames (when inside the HiLo beam) and of 40 frames when outside the HiLo beam. The localization error was set to 25 nm and the simulation was run until 100000 in-focus trajectories were recorded. More specifically, the effect of the exposure time (1 ms, 4 ms, 7 ms, 13 ms, 20 ms), the free diffusion constant (from 0.5 µm²/s to 14.5 µm²/s in 0.5 µm²/s increments) and the fraction bound (from 0 % to 95 % in 5 % increments) were investigated, yielding a dataset consisting of 3480 simulations. The advantage of simulations is that the ground truth is known. This allows a quantitative assessment of which method works the best.

Content of the archives:

  1.  the code and instructions to reproduce the simulations
  2. 4um.tar.bz2 simulated data inside a 4 µm nucleus
  3. 20um.tar.bz2 simulated data inside a 20 µm nucleus, in which virtually no confinement occurs.
  4. subsampled.tar.bz2 is a set of subsampled datasets, containing either 99999, 30000, 10000, 3000, 1000, 300, 100 or 30 trajectories. Each subsampling was done 50 times, yielding 50 files per subsmpling.


The data is provided both in CSV and .mat formats. .mat files are provided in the following dataset: 10.5281/zenodo.835541

Files (29.8 GB)
Name Size
40.0 kB Download
14.5 GB Download
14.1 GB Download
416.0 MB Download
780.5 MB Download
All versions This version
Views 236236
Downloads 8080
Data volume 533.8 GB533.8 GB
Unique views 225225
Unique downloads 4343


Cite as