Dataset Open Access

Training material for small RNA-seq data analysis (Galaxy Training Network tutorial)

Freeberg, Mallory

The data provided here are part of a Galaxy Training Network tutorial that analyzes small RNA-seq (sRNA-seq) data from a study published by Harrington et al. (DOI:10.1186/s12864-017-3692-8) to detect differential abundance of various classes of endogenous short interfering RNAs (esiRNAs). The goal of this study was to investigate "connections between differential retroTn and hp-derived esiRNA processing and cellular location, and to investigate the potential link between mRNA 3’ end cleavage and esiRNA biogenesis." To this end, sRNA-seq libraries were constructed from triplicate Drosophila tissue culture samples under conditions of either control RNAi or RNAi knockdown of a factor involved in mRNA 3’ end processing, Symplekin. This dataset (GEO Accession: GSE82128) consists of single-end, size-selected, non-rRNA-depleted sRNA-seq libraries. Because of the long processing time for the large original files, we have downsampled the original raw data files to include only reads that align to a subset of interesting transcript features including: (1) transposable elements, (2) Drosophila piRNA clusters, (3) Symplekin, and (4) genes encoding mass spectrometry-defined protein binding partners of Symplekin from Additional File 2 in the indicated paper by Harrington et al. More details on features 1 and 2 can be found here: https://github.com/bowhan/piPipes/blob/master/common/dm3/genomic_features (piRNA_Cluster, Trn). All features are from the Drosophila genome Apr. 2006 (BDGP R5/dm3) release.

Files (9.1 MB)
Name Size
Blank_RNAi_sRNA-seq_rep1_downsampled.fastqsanger.gz
md5:6638232f458ed3abbb642d2eb59a5c2b
1.1 MB Download
Blank_RNAi_sRNA-seq_rep2_downsampled.fastqsanger.gz
md5:d9e71d0c98d7c3102a02c9ce69343f84
899.9 kB Download
Blank_RNAi_sRNA-seq_rep3_downsampled.fastqsanger.gz
md5:782a05b6387f7d98372f75ac9033db1f
545.6 kB Download
dm3_miRNA_hairpin_sequences.fa.gz
md5:2c2c61d0c0ddee5bfda52e8a1872e4f6
11.1 kB Download
dm3_rRNA_sequences.fa.gz
md5:d6f423ec02f6e765e7efbcef25fc9592
2.4 kB Download
dm3_transcriptome_sequences_downsampled.fa.gz
md5:df4f879580442650564f60d020e0d0e0
1.4 MB Download
dm3_transcriptome_Tx2Gene_downsampled.tab.gz
md5:dd0cb1cb1b96f842e21a2c967a21eb66
2.1 kB Download
Symp_RNAi_sRNA-seq_rep1_downsampled.fastqsanger.gz
md5:c9119dbc9d50ab654eb55dfc48548257
2.0 MB Download
Symp_RNAi_sRNA-seq_rep2_downsampled.fastqsanger.gz
md5:c0ad66cf30bc5bd8056f86ea6efe52b2
1.5 MB Download
Symp_RNAi_sRNA-seq_rep3_downsampled.fastqsanger.gz
md5:c12859e9a9f8ea88fe0e047751038b00
1.6 MB Download
  • Afgan, E et al. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. 44, W3–W10 (2016).
  • Harrington, AW et al. Drosophila melanogaster retrotransposon and inverted repeat-derived endogenous siRNAs are differentially processed in distinct cellular locations. 18, 304 (2017).
1,632
278
views
downloads
All versions This version
Views 1,6321,632
Downloads 278278
Data volume 414.2 MB414.2 MB
Unique views 1,4321,432
Unique downloads 4242

Share

Cite as