Optocoder: computational decoding of spatially indexed bead arrays
Authors/Creators
- 1. Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association
Description
Spatial transcriptomics technologies that can quantify gene expression in space are transforming contemporary biology research. Some of such methods use spatially barcoded bead arrays that are optically sequenced by a microscopy setup to detect bead barcodes in space which can be consecutively matched to cell barcodes from the respective single cell sequencing experiment. To have good quality barcodes and a high number of barcode matches in space, robust and efficient computational pipelines are needed to process raw microscopy images and call the bases of bead barcodes accurately. Here, we present Optocoder, a computational pipeline that takes raw optical sequencing microscopy images as input and outputs bead barcodes in space. Optocoder efficiently aligns images, detects beads, and corrects for confounding factors of the fluorescence signal such as crosstalk and phasing before base calling. Furthermore, we implement a machine learning pipeline that is trained using the signal from the beads that match to illumina barcodes in order to predict non-matching bead barcodes which can boost up the number of barcode matches. We benchmark Optocoder using data from an in-house spatial transcriptomics platform as well as data from the Slide-seq method and we show that it can efficiently process both datasets with minimal modification.
Here, the datasets deposited include the following:
optocoder_data: the imaging and illumina data that are used for Optocoder runs. Folder structure is as following:
- imaging: the images acquired via a two laser microscopy setup during the optical sequencing process . There are four pucks (P1, P2, P3, P4) and for every puck there are 12 images where every image corresponds to one cycle of optical sequencing. Every image is a 6-channel TIFF image.
- illumina: cell barcodes from library sequencing that are used for the matching
- external: bead optical barcodes for the Slide-Seq and Slide-SeqV2
optocoder_v0.1.1_output: these are the output files from Optocoder runs for both in-house and Slide-Seq samples , and are used to generate the figures in the publication. Scripts to generate the figures are deposited here: https://github.com/rajewsky-lab/optocoder_scripts
Files
optocoder_data.zip
Files
(14.3 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:4c5f4f9a2c8d30aba5b4c881f82ce48c
|
6.0 GB | Preview Download |
|
md5:6c15cd376b0f12bce8b337cc18e4ed50
|
8.2 GB | Preview Download |