Published May 6, 2020 | Version v10
Dataset Open

Datasets for the Carpentry-style RNA-seq lesson

Authors/Creators

  • 1. University of Amsterdam

Description

Lesson files

For all compressed files, go to the Shell and uncompress using `tar -xzvf myarchive.tar.gz`.

1) Bioinformatic files: bioinformatic_tutorial_files.tar.gz 

This archive contains the following datasets:

FASTQ files from Arabidopsis leaf RNA-seq:

  • Arabidopsis_sample3.fq.gz
  • Arabidopsis_sample1.fq.gz
  • Arabidopsis_sample4.fq.gz
  • Arabidopsis_sample2.fq.gz

Arabidopsis thaliana genome assembly and genome annotation:

  • AtChromosome1.fa.gz
  • ath_annotation.gff3.gz

The sequence of sequencing adapters in adapters.fasta.

2) Gene counts usable with DESeq2 and R: tutorial.tar.gz

This archive contains the following datasets:

  • raw_counts.csv: a dataframe of the sample raw counts. It is a comma-separate values file therefore data are separated by commas ','.
  • samples_to_conditions.csv: a dataframe that indicates the correspondence between samples and experimental conditions (e.g. control, treated).  
  • differential_genes.csv: a dataframe that contains the result of the DESeq2 analysis specifying this contrast in `DESEq2::results()` function: `contrast = c("infected", "Pseudomonas_syringae_DC3000", "mock")

The raw_counts.csv file was obtained by running the `v0.1.1` version of a RNA-Seq bioinformatic pipeline on the mRNA-Seq sequencing files from Vogel et al. (2016): https://www.ebi.ac.uk/ena/data/view/PRJEB13938.

Please read the original study (Vogel et al. 2016): https://nph.onlinelibrary.wiley.com/doi/full/10.1111/nph.14036

 

====

Exercise files

1) NASA spaceflight

The NASA GeneLab experiment GLDS-38 performed transcriptomics and proteomics of Arabidopsis seedlings in microgravity by sending seedlings to the International Space Station (ISS).

The raw counts, scaled counts and sample to conditions files are available in the ZIP archive

2) Deforges 2019 hormone-treatments: deforges_2019.tar.gz

This archive contains:

  • arabidopsis_root_hormones_raw_counts.csv
  • arabidopsis_shoot_hormones_raw_counts.csv
  • arabidopsis_root_hormones_sample2condition.csv
  • arabidopsis_shoot_hormones_sample2condition.csv
  • dataset01_IAA_arabidopsis_root_raw_counts.csv
  • dataset02_ABA_arabidopsis_root_raw_counts.csv
  • dataset03_ACC_arabidopsis_root_raw_counts.csv
  • dataset04_MeJA_arabidopsis_root_raw_counts.csv

The arabidopsis_root_hormones_raw_counts.csv file contains all gene counts from all hormones. Separate datasets were made for each hormone for convenience.

Files

Araport11_genes_ko.txt

Files (138.4 MB)

Name Size Download all
md5:3194e27d3339ab39470be50a7994c4dd
736.5 kB Preview Download
md5:56eaeb64d39ec90605c378409310246b
104.2 MB Download
md5:ced2a569fc52c31653be573161ddee1a
3.1 MB Download
md5:dac4bf0e6cb0d09b0ddad99550bf6562
28.1 MB Download
md5:649076fe36315124923e573b69481190
2.2 MB Download