Published March 8, 2024 | Version v2.0.0
Other Open

ZARP: Supplementary materials

Description

This record supplements the F1000Research article on the ZARP workflow and its ZARP-cli command-line interface.

The following data are included:

  • zarp_use_cases.zip: Includes instructions (zarp_use_cases.md) and input data required to reproduce the use cases described in the manuscript.
  • mouse_sarcopenia_example_outputs.zip: Contains the cross-sample outputs (gene/transcript expression tables, PCA, MultiQC report and Snakemake report) produced by a ZARP run for the described 20-sample mouse sarcopenia RNA-seq experiment (not including indexes), as well as the sample-specific outputs for the smallest of the 20 samples.
  • zarp_cli_example_outputs.zip: Contains a representative fraction of the outputs produced by the described ZARP-cli runs; in particular, it contains the artifacts generated by the SRA download workflow for a single C. elegans sample (SRR21711080), the outputs produced by the HTSinfer workflow for all 25 samples (20 from the mouse sarcopenia dataset and 5 from the metadata inference demonstration run), the genome resources created by genomepy for the C. elegans genome WBcel235, and the C. elegans-specific ZARP workflow results, including indexes.

We would further like readers to note the following:

  • With respect to the latter two archives, data were selected in an effort to keep the record at an acceptable size, while still providing a representative and largely complete overview of ZARP and ZARP-cli outputs for reference and validation.
  • In all cases where files contained absolute file paths, we have manually replaced the real path prefixes with a dummy value (PATH/TO/MY/ANALYSIS), in order not to expose potentially sensitive details about the file system the analyses were run on.
  • We have deliberately not included ZARP, ZARP-cli and machine logs, as we feel that these may be too system-specific and potentially sensitive; we do, however, retain all logs in our archive, and will be willing to share them in response to reasonable requests.

Notes

Licensing information

Note that all script files in zarp_use_cases.zip (i.e., those with file extension .sh) are licensed under Apache 2.0, whereas all other files in that archive (file extensions .yaml and .tsv) are licensed under CC BY 4.0.

The mouse_sarcopenia_example_outputs.zip and zarp_cli_example_outputs.zip archives contain sample data from SRX7031689 and SRX17708425, respectively, and files derived from these and other data sets (including other RNA-seq libraries from the Sequencing Read Archive and genome resources provided by Ensembl) through the application of the ZARP pipeline and software tools contained therein. Unfortunately, we do not know exactly what this means with respect to how these data can be (re)used, so we would kindly ask you to only use them for your own personal needs, e.g., to validate the results you have obtained by reproducing the use cases described in zarp_use_cases.zip.

Notes

MK and MBak  were supported by the "Biozentrum PhD Fellowships" program.

Files

mouse_sarcopenia_example_outputs.zip

Files (6.4 GB)

Name Size Download all
md5:2c1772b178424243dbce66709c88d94d
3.2 GB Preview Download
md5:497f4a4624521b849d2ea25a4320d303
3.2 GB Preview Download
md5:fcc05c67cfc6030c23cd02b57402ddd7
9.4 kB Preview Download

Additional details

Related works

Is published in
Preprint: 10.1101/2021.11.18.469017 (DOI)

Funding

Swiss National Science Foundation
Cell type-specific expression of 3’ untranslated region isoforms: quantification, modeling, and prediction of functional impact 189063
Swiss National Science Foundation
NCCR RNA & disease: The role of RNA biology in disease mechanisms (phase II) 182880