There is a newer version of the record available.

Published June 3, 2020 | Version 1.0.0
Dataset Open

Learning to embed lifetime social behavior from interaction dynamics - Data

  • 1. Department of Computer Science, Freie Universit ̈at, Berlin, Germany
  • 2. Department of Collective Behaviour, Max Planck Institute of Animal Behavior,Radolfzell, Germany

Description

Interaction matrices and metadata used in "Learning to embed lifetime social behavior from interaction dynamics"

The following files are included:

  • interactions_bn16_sparse.npz and interactions_bn19_sparse.npz: These are the interaction affinity matrices for the BN16 and BN19 datasets as described in the publication. The data is stored as compressed sparse tensors with time on the first, and the individuals on the second and third dimensions. The data was stored using the pydata/sparse library 0.9.1
  • alive_bn16.csv and alive_bn19.csv: These files contain the dates of emergence (also corresponding to the dates they were introduced into the colonies) and heuristically determined number days alive for all individuals in the interaction matrices. Death dates were determined using a bayesian changepoint model and the number of daily detections of each individual

  • rhythmicity_bn16.csv and rhythmicity_bn19.csv: These files contain the circadian rhythmicity values used in the evaluation of the method. The circadian rhythmicity is the \(R^2\) value of a sine with a 24 hour period fitted to the individuals' movement velocities over a three day window

  • indices_bn16.csv and indices_bn19.csv: These files contain the mapping between the original marker IDs used during the recording of the data (which has gaps, because not all markers were used) and the sequential indices used in the interaction matrices. These files can therefore be used to look up the original ID of an individual based on it's index in the interaction matrix and vice versa

  • time_spent_on_substrates.csv: This data was used for the mapping from factors to the proportion of time spent on various cell substrates (Figure 5). The positions of the individuals were accumulated by minute, and the column "location_descriptor_count" contains the total number of minutes on the respective day that the individual was detected

See 10.1101/2020.05.06.076943 for more details about the bayesian changepoint model, circadian rhythmicity calculation, and location mapping.

Files

alive_bn16.csv

Files (209.9 MB)

Name Size Download all
md5:acc920be5d8521e3642e723fa6380323
45.3 kB Preview Download
md5:d893c0cee617444747995fe11ae31f96
126.1 kB Preview Download
md5:5c060e9240a22f7dc993db5b32f6180d
22.9 kB Preview Download
md5:ed26f09d59a86e25c91389f1dfed936a
66.5 kB Preview Download
md5:bac9d5597216f02f60a775ec4942f09f
44.6 MB Download
md5:54c367473bc2f1499b202d83b433f115
157.2 MB Download
md5:527425d10964c3fdfc881f1d4d0f2345
1.1 MB Preview Download
md5:87961ba896a2ed54dcba04cd9d27234a
3.5 MB Preview Download
md5:3362501bd349673a6be521510af2048e
3.3 MB Preview Download