Published July 13, 2017 | Version v1
Dataset Open

Training material for small RNA-seq data analysis (Galaxy Training Network tutorial)

  • 1. Johns Hopkins University


The data provided here are part of a Galaxy Training Network tutorial that analyzes small RNA-seq (sRNA-seq) data from a study published by Harrington et al. (DOI:10.1186/s12864-017-3692-8) to detect differential abundance of various classes of endogenous short interfering RNAs (esiRNAs). The goal of this study was to investigate "connections between differential retroTn and hp-derived esiRNA processing and cellular location, and to investigate the potential link between mRNA 3’ end cleavage and esiRNA biogenesis." To this end, sRNA-seq libraries were constructed from triplicate Drosophila tissue culture samples under conditions of either control RNAi or RNAi knockdown of a factor involved in mRNA 3’ end processing, Symplekin. This dataset (GEO Accession: GSE82128) consists of single-end, size-selected, non-rRNA-depleted sRNA-seq libraries. Because of the long processing time for the large original files, we have downsampled the original raw data files to include only reads that align to a subset of interesting transcript features including: (1) transposable elements, (2) Drosophila piRNA clusters, (3) Symplekin, and (4) genes encoding mass spectrometry-defined protein binding partners of Symplekin from Additional File 2 in the indicated paper by Harrington et al. More details on features 1 and 2 can be found here: (piRNA_Cluster, Trn). All features are from the Drosophila genome Apr. 2006 (BDGP R5/dm3) release.


Files (9.1 MB)

Name Size Download all
1.1 MB Download
899.9 kB Download
545.6 kB Download
11.1 kB Download
2.4 kB Download
1.4 MB Download
2.1 kB Download
2.0 MB Download
1.5 MB Download
1.6 MB Download

Additional details

Related works


  • Harrington, AW et al. Drosophila melanogaster retrotransposon and inverted repeat-derived endogenous siRNAs are differentially processed in distinct cellular locations. 18, 304 (2017).
  • Afgan, E et al. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. 44, W3–W10 (2016).