Published February 13, 2024 | Version v1
Dataset Open

Demo 10x Visium dataset for STQ

  • 1. The Jackson Laboratory For Genomic Medicine

Description

The small dataset includes FASTQ files with 13,043,592 reads from 65 Visium spots of sample WM4237_TE_S1_ST. See https://doi.org/10.1101/2023.07.27.550727 for the full datasets details. Below is the code used to subset FASTQ reads to the 65 spots selected in "subset.csv"

fastq1="SC2200092_WM4237-1-1337/fastq/SC2200092_GT22-04384_AGTTTCCTGG-CTGTGTGGCA_S2_L001_R1_001.fastq"
fastq2="SC2200092_WM4237-1-1337/fastq/SC2200092_GT22-04384_AGTTTCCTGG-CTGTGTGGCA_S2_L001_R2_001.fastq"

fgrep -B 1 -f subset.csv ${fastq1} | grep "@" | cut -d' ' -f1 > readnames.list

seqtk subseq fastq1<(catreadnames.list|cutd@f2)>subsetL001R1001.fastqseqtksubseq{fastq2} <(cat readnames.list | cut -d '@' -f2) > subset_L001_R2_001.fastq

gzip -kv subset_L001_R1_001.fastq
gzip -kv subset_L001_R2_001.fastq

The data includes output of STQ demo run.

Files

SC2200092.tiff

Files (4.8 GB)

Name Size Download all
md5:d6b741f5da1aa0a9d8394dc9ef745a45
756.8 MB Download
md5:086afe81cad16f6406f3e4c0555c187c
78.6 MB Download
md5:71652029a3b14d865df7ed1a46db6836
4.0 GB Preview Download

Additional details

Related works

Is source of
Preprint: 37546876 (PMID)

Software

Repository URL
https://github.com/TheJacksonLaboratory/STQ
Programming language
Nextflow, Python
Development Status
Active